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PREFACE 


The success of the first seven editions of Intermediate Microeconomics has 
pleased me very much. It has confirmed my belief that the market would 
welcome an analytic approach to microeconomics at the undergraduate 
level. 

My aim in writing the first edition was to present a treatment of the 
methods of microeconomics that would allow students to apply these tools 
on their own and not just passively absorb the predigested cases described 
in the text. I have found that the best way to do this is to emphasize 
the fundamental conceptual foundations of microeconomics and to provide 
concrete examples of their application rather than to attempt to provide 
an encyclopedia of terminology and anecdote. 

A challenge in pursuing this approach arises from the lack of mathemat¬ 
ical prerequisites for economics courses at many colleges and universities. 
The lack of calculus and problem-solving experience in general makes it 
difficult to present some of the analytical methods of economics. How¬ 
ever, it is not impossible. One can go a long way with a few simple facts 
about linear demand functions and supply functions and some elementary 
algebra. It is perfectly possible to be analytical without being excessively 
mathematical. 

The distinction is worth emphasizing. An analytical approach to eco¬ 
nomics is one that uses rigorous, logical reasoning. This does not neces¬ 
sarily require the use of advanced mathematical methods. The language 
of mathematics certainly helps to ensure a rigorous analysis and using it 
is undoubtedly the best way to proceed when possible, but it may not be 
appropriate for all students. 
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Many undergraduate majors in economics are students who should know 
calculus, but don’t—at least, not very well. For this reason I have kept cal¬ 
culus out of the main body of the text. However, I have provided complete 
calculus appendices to many of the chapters. This means that the calculus 
methods are there for the students who can handle them, but they do not 
pose a barrier to understanding for the others. 

I think that this approach manages to convey the idea that calculus is 
not just a footnote to the argument of the text, but is instead a deeper 
way to examine the same issues that one can also explore verbally and 
graphically. Many arguments are much simpler with a little mathematics, 
and all economics students should learn that. In many cases I’ve found 
that with a little motivation, and a few nice economic examples, students 
become quite enthusiastic about looking at things from an analytic per¬ 
spective. 

There are several other innovations in this text. First, the chapters are 
generally very short. I’ve tried to make most of them roughly “lecture 
size,” so that they can be read at one sitting. I have followed the standard 
order of discussing first consumer theory and then producer theory, but 
I’ve spent a bit more time on consumer theory than is normally the case. 
This is not because I think that consumer theory is necessarily the most 
important part of microeconomics; rather, I have found that this is the 
material that students find the most mysterious, so I wanted to provide a 
more detailed treatment of it. 

Second, I’ve tried to put in a lot of examples of how to use the theory 
described here. In most books, students look at a lot of diagrams of shifting 
curves, but they don’t see much algebra, or much calculation of any sort for 
that matter. But it is the algebra that is used to solve problems in practice. 
Graphs can provide insight, but the real power of economic analysis comes 
in calculating quantitative answers to economic problems. Every economics 
student should be able to translate an economic story into an equation or 
a numerical example, but all too often the development of this skill is 
neglected. For this reason I have also provided a workbook that I feel is 
an integral accompaniment to this book. The workbook was written with 
my colleague Theodore Bergstrom, and we have put a lot of effort into 
generating interesting and instructive problems. We think that it provides 
an important aid to the student of microeconomics. 

Third, I believe that the treatment of the topics in this book is more 
accurate than is usually the case in intermediate micro texts. It is true 
that I’ve sometimes chosen special cases to analyze when the general case 
is too difficult, but I’ve tried to be honest about that when I did it. In 
general, I’ve tried to spell out every step of each argument in detail. I 
believe that the discussion I’ve provided is not only more complete and more 
accurate than usual, but this attention to detail also makes the arguments 
easier to understand than the loose discussion presented in many other 
books. 
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There Are Many Paths to Economic Enlightenment 

There is more material in this book than can comfortably be taught in one 
semester, so it is worthwhile picking and choosing carefully the material 
that you want to study. If you start on page 1 and proceed through the 
chapters in order, you will run out of time long before you reach the end 
of the book. The modular structure of the book allows the instructor a 
great deal of freedom in choosing how to present the material, and I hope 
that more people will take advantage of this freedom. The following chart 
illustrates the chapter dependencies. 



The dark colored chapters are “core” chapters—they should probably be 
covered in every intermediate microeconomics course. The light-colored 
chapters are “optional” chapters: I cover some but not all of these every 
semester. The gray chapters are chapters I usually don’t cover in my course, 
but they could easily be covered in other courses. A solid line going from 
Chapter A to Chapter B means that Chapter A should be read before 
chapter B. A broken line means that Chapter B requires knowing some 
material in Chapter A, but doesn’t depend on it in a significant way. 

I generally cover consumer theory and markets and then proceed directly 
to producer theory. Another popular path is to do exchange right after 
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consumer theory; many instructors prefer this route and I have gone to 
some trouble to make sure that this path is possible. 

Some people like to do producer theory before consumer theory. This is 
possible with this text, but if you choose this path, you will need to sup¬ 
plement the textbook treatment. The material on isoquants, for example, 
assumes that the students have already seen indifference curves. 

Much of the material on public goods, externalities, law, and information 
can be introduced earlier in the course. I’ve arranged the material so that 
it is quite easy to put it pretty much wherever you desire. 

Similarly, the material on public goods can be introduced as an illus¬ 
tration of Edgeworth box analysis. Externalities can be introduced right 
after the discussion of cost curves, and topics from the information chapter 
can be introduced almost anywhere after students are familiar with the 
approach of economic analysis. 

Changes for the Eight Edition 

In this edition I have added several new examples involving events, in¬ 
cluding copyright extension, asset price bubbles, counterparty risk, value 
at risk, and carbon taxes. I have continued to offer examples drawn from 
Silicon Valley firms such as Apple, eBay, Google, Yahoo and others. I dis¬ 
cuss topics such as the complementarity between the iPod and iTunes, the 
positive feedback associated with companies such as Facebook, and the ad 
auction models used by Google, Microsoft, and Yahoo. I believe that these 
are fresh and interesting examples of economics in action. 

I’ve also added an extended discussion of mechanism design issues, in¬ 
cluding two-sided matching markets and the Vickrey-Clarke-Groves mech¬ 
anisms. This field, which was once primarily theoretical in nature, has now 
taken on considerable practical importance. 

The Test Bank and Workbook 

The workbook, Workouts in Intermediate Microeconomics, is an integral 
part of the course. It contains hundreds of fill-in-the-blank exercises that 
lead the students through the steps of actually applying the tools they have 
learned in the textbook. In addition to the exercises, Workouts contains a 
collection of short multiple-choice quizzes based on the workbook problems 
in each chapter. Answers to the quizzes are also included in Workouts. 
These quizzes give a quick way for the student to review the material he 
or she has learned by working the problems in the workbook. 

But there is more ... instructors who have adopted Workouts for their 
course can make use of the Test Bank offered with the textbook. The 
Test Bank contains several alternative versions of each Workouts quiz. 
The questions in these quizzes use different numerical values but the same 
internal logic. They can be used to provide additional problems for students 


PREFACE XXIII 


to practice on, or to give quizzes to be taken in class. Grading is quick 
and reliable because the quizzes are multiple choice and can be graded 
electronically. 

In our course, we tell the students to work through all the quiz questions 
for each chapter, either by themselves or with a study group. Then during 
the term we have a short in-class quiz every other week or so, using the 
alternative versions from the Test Bank. These are essentially the Work¬ 
outs quizzes with different numbers. Hence, students who have done their 
homework find it easy to do well on the quizzes. 

We firmly believe that you can’t learn economics without working some 
problems. The quizzes provided in Workouts and in the Test Bank make 
the learning process much easier for both the student and the teacher. 

A hard copy of the Test Bank is available from the publisher, as is the 
textbook’s Instructor’s Manual , which includes my teaching suggestions 
and lecture notes for each chapter of the textbook, and solutions to the 
exercises in Workouts. 

A number of other useful ancillaries are also available with this text¬ 
book. These include a comprehensive set of PowerPoint slides, as well 
as the Norton Economic News Service, which alerts students to economic 
news related to specific material in the textbook. For information on 
these and other ancillaries, please visit the homepage for the book at 
http://www.wwnorton.com/varian. 

The Production of the Book 

The entire book was typeset by the author using Tj^X, the wonderful type¬ 
setting system designed by Donald Knuth. I worked on a Linux system 
and using GNU emacs for editing, res for version control and the T^XLive 
system for processing. I used makeindex for the index, and Trevor Darrell’s 
psf ig software for inserting the diagrams. 

The book design was by Nancy Dale Muldoon, with some modifications 
by Roy Tedoff and the author. Jack Repchek coordinated the whole effort 
in his capacity as editor. 
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CHAPTER 



THE MARKET 


The conventional first chapter of a microeconomics book is a discussion of 
the “scope and methods” of economics. Although this material can be very 
interesting, it hardly seems appropriate to begin your study of economics 
with such material. It is hard to appreciate such a discussion until you 
have seen some examples of economic analysis in action. 

So instead, we will begin this book with an example of economic analysis. 
In this chapter we will examine a model of a particular market, the market 
for apartments. Along the way we will introduce several new ideas and tools 
of economics. Don’t worry if it all goes by rather quickly. This chapter 
is meant only to provide a quick overview of how these ideas can be used. 
Later on we will study them in substantially more detail. 


1.1 Constructing a Model 

Economics proceeds by developing models of social phenomena. By a 
model we mean a simplified representation of reality. The emphasis here 
is on the word “simple.” Think about how useless a map on a one-to-one 
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scale would be. The same is true of an economic model that attempts to de¬ 
scribe every aspect of reality. A model’s power stems from the elimination 
of irrelevant detail, which allows the economist to focus on the essential 
features of the economic reality he or she is attempting to understand. 

Here we are interested in what determines the price of apartments, so 
we want to have a simplified description of the apartment market. There 
is a certain art to choosing the right simplifications in building a model. In 
general we want to adopt the simplest model that is capable of describing 
the economic situation we are examining. We can then add complications 
one at a time, allowing the model to become more complex and, we hope, 
more realistic. 

The particular example we want to consider is the market for apartments 
in a medium-size midwestern college town. In this town there are two 
sorts of apartments. There are some that are adjacent to the university, 
and others that are farther away. The adjacent apartments are generally 
considered to be more desirable by students, since they allow easier access 
to the university. The apartments that are farther away necessitate taking 
a bus, or a long, cold bicycle ride, so most students would prefer a nearby 
apartment ... if they can afford one. 

We will think of the apartments as being located in two large rings sur¬ 
rounding the university. The adjacent apartments are in the inner ring, 
while the rest are located in the outer ring. We will focus exclusively on 
the market for apartments in the inner ring. The outer ring should be inter¬ 
preted as where people can go who don’t find one of the closer apartments. 
We’ll suppose that there are many apartments available in the outer ring, 
and their price is fixed at some known level. We’ll be concerned solely with 
the determination of the price of the inner-ring apartments and who gets 
to live there. 

An economist would describe the distinction between the prices of the two 
kinds of apartments in this model by saying that the price of the outer-ring 
apartments is an exogenous variable, while the price of the inner-ring 
apartments is an endogenous variable. This means that the price of 
the outer-ring apartments is taken as determined by factors not discussed 
in this particular model, while the price of the inner-ring apartments is 
determined by forces described in the model. 

The first simplification that we’ll make in our model is that all apart¬ 
ments are identical in every respect except for location. Thus it will 
make sense to speak of “the price” of apartments, without worrying about 
whether the apartments have one bedroom, or two bedrooms, or whatever. 

But what determines this price? What determines who will live in 
the inner-ring apartments and who will live farther out? What can be 
said about the desirability of different economic mechanisms for allocating 
apartments? What concepts can we use to judge the merit of different 
assignments of apartments to individuals? These are all questions that we 
want our model to address. 
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1.2 Optimization and Equilibrium 

Whenever we try to explain the behavior of human beings we need to have 
a framework on which our analysis can be based. In much of economics we 
use a framework built on the following two simple principles. 

The optimization principle: People try to choose the best patterns of 
consumption that they can afford. 

The equilibrium principle: Prices adjust until the amount that people 
demand of something is equal to the amount that is supplied. 

Let us consider these two principles. The first is almost tautological. If 
people are free to choose their actions, it is reasonable to assume that they 
try to choose things they want rather than things they don’t want. Of 
course there are exceptions to this general principle, but they typically lie 
outside the domain of economic behavior. 

The second notion is a bit more problematic. It is at least conceivable 
that at any given time peoples’ demands and supplies are not compati¬ 
ble, and hence something must be changing. These changes may take a 
long time to work themselves out, and, even worse, they may induce other 
changes that might “destabilize” the whole system. 

This kind of thing can happen ... but it usually doesn’t. In the case 
of apartments, we typically see a fairly stable rental price from month to 
month. It is this equilibrium price that we are interested in, not in how the 
market gets to this equilibrium or how it might change over long periods 
of time. 

It is worth observing that the definition used for equilibrium may be 
different in different models. In the case of the simple market we will 
examine in this chapter, the demand and supply equilibrium idea will be 
adequate for our needs. But in more general models we will need more 
general definitions of equilibrium. Typically, equilibrium will require that 
the economic agents’ actions must be consistent with each other. 

How do we use these two principles to determine the answers to the 
questions we raised above? It is time to introduce some economic concepts. 


1.3 The Demand Curve 

Suppose that we consider all of the possible renters of the apartments and 
ask each of them the maximum amount that he or she would be willing to 
pay to rent one of the apartments. 

Let’s start at the top. There must be someone who is willing to pay 
the highest price. Perhaps this person has a lot of money, perhaps he is 
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very lazy and doesn’t want to walk far ... or whatever. Suppose that this 
person is willing to pay $500 a month for an apartment. 

If there is only one person who is willing to pay $500 a month to rent 
an apartment, then if the price for apartments were $500 a month, exactly 
one apartment would be rented—to the one person who was willing to pay 
that price. 

Suppose that the next highest price that anyone is willing to pay is $490. 
Then if the market price were $499, there would still be only one apartment 
rented: the person who was willing to pay $500 would rent an apartment, 
but the person who was willing to pay $490 wouldn’t. And so it goes. Only 
one apartment would be rented if the price were $498, $497, $496, and so 
on ... until we reach a price of $490. At that price, exactly two apartments 
would be rented: one to the $500 person and one to the $490 person. 

Similarly, two apartments would be rented until we reach the maximum 
price that the person with the third highest price would be willing to pay, 
and so on. 

Economists call a person’s maximum willingness to pay for something 
that person’s reservation price. The reservation price is the highest 
price that a given person will accept and still purchase the good. In other 
words, a person’s reservation price is the price at which he or she is just 
indifferent between purchasing or not purchasing the good. In our example, 
if a person has a reservation price p it means that he or she would be just 
indifferent between living in the inner ring and paying a price p and living 
in the outer ring. 

Thus the number of apartments that will be rented at a given price p* 
will just be the number of people who have a reservation price greater than 
or equal to p*. For if the market price is p*, then everyone who is willing 
to pay at least p* for an apartment will want an apartment in the inner 
ring, and everyone who is not willing to pay p* will choose to live in the 
outer ring. 

We can plot these reservation prices in a diagram as in Figure 1.1. Here 
the price is depicted on the vertical axis and the number of people who are 
willing to pay that price or more is depicted on the horizontal axis. 

Another way to view Figure 1.1 is to think of it as measuring how many 
people would want to rent apartments at any particular price. Such a curve 
is an example of a demand curve —a curve that relates the quantity 
demanded to price. When the market price is above $500, zero apart¬ 
ments will be rented. When the price is between $500 and $490, one 
apartment will be rented. When it is between $490 and the third high¬ 
est reservation price, two apartments will be rented, and so on. The 
demand curve describes the quantity demanded at each of the possible 
prices. 

The demand curve for apartments slopes down: as the price of apart¬ 
ments decreases more people will be willing to rent apartments. If there are 
many people and their reservation prices differ only slightly from person to 
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The demand curve for apartments. The vertical axis mea¬ 
sures the market price and the horizontal axis measures how 
many apartments will be rented at each price. 


Figure 

1.1 


person, it is reasonable to think of the demand curve as sloping smoothly 
downward, as in Figure 1.2. The curve in Figure 1.2 is what the demand 
curve in Figure 1.1 would look like if there were many people who want to 
rent the apartments. The “jumps” shown in Figure 1.1 are now so small 
relative to the size of the market that we can safely ignore them in drawing 
the market demand curve. 


1.4 The Supply Curve 

We now have a nice graphical representation of demand behavior, so let us 
turn to supply behavior. Here we have to think about the nature of the 
market we are examining. The situation we will consider is where there are 
many independent landlords who are each out to rent their apartments for 
the highest price the market will bear. We will refer to this as the case of a 
competitive market. Other sorts of market arrangements are certainly 
possible, and we will examine a few later. 

For now, let’s consider the case where there are many landlords who all 
operate independently. It is clear that if all landlords are trying to do the 
best they can and if the renters are fully informed about the prices the 
landlords charge, then the equilibrium price of all apartments in the inner 
ring must be the same. The argument is not difficult. Suppose instead 
that there is some high price, ph, and some low price, pi, being charged 
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Figure 

1.2 



Demand curve for apartments with many demanders. 

Because of the large number of demanders, the jumps between 
prices will be small, and the demand curve will have the con¬ 
ventional smooth shape. 


for apartments. The people who are renting their apartments for a high 
price could go to a landlord renting for a low price and offer to pay a rent 
somewhere between ph and pi. A transaction at such a price would make 
both the renter and the landlord better off. To the extent that all parties 
are seeking to further their own interests and are aware of the alternative 
prices being charged, a situation with different prices being charged for the 
same good cannot persist in equilibrium. 

But what will this single equilibrium price be? Let us try the method 
that we used in our construction of the demand curve: we will pick a price 
and ask how many apartments will be supplied at that price. 

The answer depends to some degree on the time frame in which we are 
examining the market. If we are considering a time frame of several years, 
so that new construction can take place, the number of apartments will 
certainly respond to the price that is charged. But in the “short run” — 
within a given year, say— the number of apartments is more or less fixed. 
If we consider only this short-run case, the supply of apartments will be 
constant at some predetermined level. 

The supply curve in this market is depicted in Figure 1.3 as a vertical 
line. Whatever price is being charged, the same number of apartments will 
be rented, namely, all the apartments that are available at that time. 
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1.3 


1.5 Market Equilibrium 

We now have a way of representing the demand and the supply side of the 
apartment market. Let us put them together and ask what the equilibrium 
behavior of the market is. We do this by drawing both the demand and 
the supply curve on the same graph in Figure 1.4. 

In this graph we have used p* to denote the price where the quantity 
of apartments demanded equals the quantity supplied. This is the equi¬ 
librium price of apartments. At this price, each consumer who is willing 
to pay at least p* is able to find an apartment to rent, and each landlord 
will be able to rent apartments at the going market price. Neither the con¬ 
sumers nor the landlords have any reason to change their behavior. This 
is why we refer to this as an equilibrium: no change in behavior will be 
observed. 

To better understand this point, let us consider what would happen at 
a price other than p*. For example, consider some price p < p* where 
demand is greater than supply. Can this price persist? At this price at 
least some of the landlords will have more renters than they can handle. 
There will be lines of people hoping to get an apartment at that price; 
there are more people who are willing to pay the price p than there are 
apartments. Certainly some of the landlords would find it in their interest 
to raise the price of the apartments they are offering. 

Similarly, suppose that the price of apartments is some p greater than p*. 
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, is determined by the intersection of the supply and 
curves. 


Then some of the apartments will be vacant: there are fewer people who 
are willing to pay p than there are apartments. Some of the landlords are 
now in danger of getting no rent at all for their apartments. Thus they will 
have an incentive to lower their price in order to attract more renters. 

If the price is above p* there are too few renters; if it is below p* there are 
too many renters. Only at the price of p* is the number of people who are 
willing to rent at that price equal to the number of apartments available 
for rent. Only at that price does demand equal supply. 

At the price p* the landlords’ and the renters’ behaviors are compatible 
in the sense that the number of apartments demanded by the renters at p* 
is equal to the number of apartments supplied by the landlords. This is 
the equilibrium price in the market for apartments. 

Once we’ve determined the market price for the inner-ring apartments, 
we can ask who ends up getting these apartments and who is exiled to the 
farther-away apartments. In our model there is a very simple answer to 
this question: in the market equilibrium everyone who is willing to pay p* 
or more gets an apartment in the inner ring, and everyone who is willing 
to pay less than p* gets one in the outer ring. The person who has a reser¬ 
vation price of p* is just indifferent between taking an apartment in the 
inner ring and taking one in the outer ring. The other people in the inner 
ring are getting their apartments at less than the maximum they would be 
willing to pay for them. Thus the assignment of apartments to renters is 
determined by how much they are willing to pay. 
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1.6 Comparative Statics 

Now that we have an economic model of the apartment market, we can 
begin to use it to analyze the behavior of the equilibrium price. For exam¬ 
ple, we can ask how the price of apartments changes when various aspects 
of the market change. This kind of an exercise is known as compara¬ 
tive statics, because it involves comparing two “static” equilibria without 
worrying about how the market moves from one equilibrium to another. 

The movement from one equilibrium to another can take a substantial 
amount of time, and questions about how such movement takes place can 
be very interesting and important. But we must walk before we can run, 
so we will ignore such dynamic questions for now. Comparative statics 
analysis is only concerned with comparing equilibria, and there will be 
enough questions to answer in this framework for the present. 

Let’s start with a simple case. Suppose that the supply of apartments is 
increased, as in Figure 1.5. 



Increasing the supply of apartments. As the supply of 
apartments increases, the equilibrium price decreases. 


Figure 

1.5 


It is easy to see in this diagram that the equilibrium price of apartments 
will fall. Similarly, if the supply of apartments were reduced the equilibrium 
price would rise. 
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Let’s try a more complicated—and more interesting—example. Suppose 
that a developer decides to turn several of the apartments into condomini¬ 
ums. What will happen to the price of the remaining apartments? 

Your first guess is probably that the price of apartments will go up, 
since the supply has been reduced. But this isn’t necessarily right. It is 
true that the supply of apartments to rent has been reduced. But the de¬ 
mand for apartments has been reduced as well, since some of the people 
who were renting apartments may decide to purchase the new condomini¬ 
ums. 

It is natural to assume that the condominium purchasers come from 
those who already live in the inner-ring apartments—those people who 
are willing to pay more than p* for an apartment. Suppose, for example, 
that the demanders with the 10 highest reservation prices decide to buy 
condos rather than rent apartments. Then the new demand curve is just 
the old demand curve with 10 fewer demanders at each price. Since there 
are also 10 fewer apartments to rent, the new equilibrium price is just 
what it was before, and exactly the same people end up living in the inner- 
ring apartments. This situation is depicted in Figure 1.6. Both the demand 
curve and the supply curve shift left by 10 apartments, and the equilibrium 
price remains unchanged. 


RESERVATION 
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Figure 

1.6 


Effect of creating condominiums. If demand and supply 
both shift left by the same amount the equilibrium price is un¬ 
changed. 
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Most people find this result surprising. They tend to see just the reduc¬ 
tion in the supply of apartments and don’t think about the reduction in 
demand. The case we’ve considered is an extreme one: all of the condo pur¬ 
chasers were former apartment dwellers. But the other case—where none 
of the condo purchasers were apartment dwellers—is even more extreme. 

The model, simple though it is, has led us to an important insight. If we 
want to determine how conversion to condominiums will affect the apart¬ 
ment market, we have to consider not only the effect on the supply of 
apartments but also the effect on the demand for apartments. 

Let’s consider another example of a surprising comparative statics anal¬ 
ysis: the effect of an apartment tax. Suppose that the city council decides 
that there should be a tax on apartments of $50 a year. Thus each landlord 
will have to pay $50 a year to the city for each apartment that he owns. 
What will this do to the price of apartments? 

Most people would think that at least some of the tax would get passed 
along to apartment renters. But, rather surprisingly, that is not the case. 
In fact, the equilibrium price of apartments will remain unchanged! 

In order to verify this, we have to ask what happens to the demand curve 
and the supply curve. The supply curve doesn’t change—there are just as 
many apartments after the tax as before the tax. And the demand curve 
doesn’t change either, since the number of apartments that will be rented 
at each different price will be the same as well. If neither the demand curve 
nor the supply curve shifts, the price can’t change as a result of the tax. 

Here is a way to think about the effect of this tax. Before the tax is 
imposed, each landlord is charging the highest price that he can get that 
will keep his apartments occupied. The equilibrium price p* is the highest 
price that can be charged that is compatible with all of the apartments 
being rented. After the tax is imposed can the landlords raise their prices to 
compensate for the tax? The answer is no: if they could raise the price and 
keep their apartments occupied, they would have already done so. If they 
were charging the maximum price that the market could bear, the landlords 
couldn’t raise their prices any more: none of the tax can get passed along 
to the renters. The landlords have to pay the entire amount of the tax. 

This analysis depends on the assumption that the supply of apartments 
remains fixed. If the number of apartments can vary as the tax changes, 
then the price paid by the renters will typically change. We’ll examine this 
kind of behavior later on, after we’ve built up some more powerful tools 
for analyzing such problems. 


1.7 Other Ways to Allocate Apartments 

In the previous section we described the equilibrium for apartments in 
a competitive market. But this is only one of many ways to allocate a 
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resource; in this section we describe a few other ways. Some of these may 
sound rather strange, but each will illustrate an important economic point. 


The Discriminating Monopolist 

First, let us consider a situation where there is one dominant landlord who 
owns all of the apartments. Or, alternatively, we could think of a number 
of individual landlords getting together and coordinating their actions to 
act as one. A situation where a market is dominated by a single seller of a 
product is known as a monopoly. 

In renting the apartments the landlord could decide to auction them off 
one by one to the highest bidders. Since this means that different people 
would end up paying different prices for apartments, we will call this the 
case of the discriminating monopolist. Let us suppose for simplicity 
that the discriminating monopolist knows each person’s reservation price 
for apartments. (This is not terribly realistic, but it will serve to illustrate 
an important point.) 

This means that he would rent the first apartment to the fellow who 
would pay the most for it, in this case $500. The next apartment would go 
for $490 and so on as we moved down the demand curve. Each apartment 
would be rented to the person who was willing to pay the most for it. 

Here is the interesting feature of the discriminating monopolist: exactly 
the same people will get the apartments as in the case of the market solution, 
namely, everyone who valued an apartment at more than p*. The last 
person to rent an apartment pays the price p* —the same as the equilibrium 
price in a competitive market. The discriminating monopolist’s attempt to 
maximize his own profits leads to the same allocation of apartments as the 
supply and demand mechanism of the competitive market. The amount the 
people pay is different, but who gets the apartments is the same. It turns 
out that this is no accident, but we’ll have to wait until later to explain 
the reason. 


The Ordinary Monopolist 

We assumed that the discriminating monopolist was able to rent each apart¬ 
ment at a different price. But what if he were forced to rent all apartments 
at the same price? In this case the monopolist faces a tradeoff: if he chooses 
a low price he will rent more apartments, but he may end up making less 
money than if he sets a higher price. 

Let us use D(p) to represent the demand function—the number of apart¬ 
ments demanded at price p. Then if the monopolist sets a price p , he will 
rent D(p) apartments and thus receive a revenue of pD(p). The revenue 
that the monopolist receives can be thought of as the area of a box: the 
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height of the box is the price p and the width of the box is the number of 
apartments D(p). The product of the height and the width—the area of 
the box—is the revenue the monopolist receives. This is the box depicted 
in Figure 1.7. 


Supply 


D(p) 


Demand 

NUMBER OF APARTMENTS 


Revenue box. The revenue received by the monopolist is just 
the price times the quantity, which can be interpreted as the 
area of the box illustrated. 


Figure 

1.7 


If the monopolist has no costs associated with renting an apartment, he 
would want to choose a price that has the largest associated revenue box. 
The largest revenue box in Figure 1.7 occurs at the price p. In this case 
the monopolist will find it in his interest not to rent all of the apartments. 
In fact this will generally be the case for a monopolist. The monopolist 
will want to restrict the output available in order to maximize his profit. 
This means that the monopolist will generally want to charge a price that 
is higher than the equilibrium price in a competitive market, p*. In the 
case of the ordinary monopolist, fewer apartments will be rented, and each 
apartment will be rented at a higher price than in the competitive market. 


Rent Control 

A third and final case that we will discuss will be the case of rent control. 
Suppose that the city decides to impose a maximum rent that can be 
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charged for apartments, say p m ax- We suppose that the price p m ax is less 
than the equilibrium price in the competitive market, p*. If this is so we 
would have a situation of excess demand: there are more people who are 
willing to rent apartments at p m ax than there are apartments available. 
Who will end up with the apartments? 

The theory that we have described up until now doesn’t have an answer 
to this question. We can describe what will happen when supply equals 
demand, but we don’t have enough detail in the model to describe what 
will happen if supply doesn’t equal demand. The answer to who gets the 
apartments under rent control depends on who has the most time to spend 
looking around, who knows the current tenants, and so on. All of these 
things are outside the scope of the simple model we’ve developed. It may 
be that exactly the same people get the apartments under rent control as 
under the competitive market. But that is an extremely unlikely outcome. 
It is much more likely that some of the formerly outer-ring people will 
end up in some of the inner-ring apartments and thus displace the people 
who would have been living there under the market system. So under rent 
control the same number of apartments will be rented at the rent-controlled 
price as were rented under the competitive price: they’ll just be rented to 
different people. 


1.8 Which Way Is Best? 

We’ve now described four possible ways of allocating apartments to people: 

• The competitive market. 

• A discriminating monopolist. 

• An ordinary monopolist. 

• Rent control. 

These are four different economic institutions for allocating apartments. 
Each method will result in different people getting apartments or in differ¬ 
ent prices being charged for apartments. We might well ask which economic 
institution is best. But first we have to define “best.” What criteria might 
we use to compare these ways of allocating apartments? 

One thing we can do is to look at the economic positions of the people 
involved. It is pretty obvious that the owners of the apartments end up 
with the most money if they can act as discriminating monopolists: this 
would generate the most revenues for the apartment owner(s). Similarly 
the rent-control solution is probably the worst situation for the apartment 
owners. 

What about the renters? They are probably worse off on average in 
the case of a discriminating monopolist—most of them would be paying a 
higher price than they would under the other ways of allocating apartments. 
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Are the consumers better off in the case of rent control? Some of them are: 
the consumers who end up getting the apartments are better off than they 
would be under the market solution. But the ones who didn’t get the 
apartments are worse off than they would be under the market solution. 

What we need here is a way to look at the economic position of all the 
parties involved—all the renters and all the landlords. How can we examine 
the desirability of different ways to allocate apartments, taking everybody 
into account? What can be used as a criterion for a “good” way to allocate 
apartments taking into account all of the parties involved? 


1.9 Pareto Efficiency 

One useful criterion for comparing the outcomes of different economic insti¬ 
tutions is a concept known as Pareto efficiency or economic efficiency. 1 We 
start with the following definition: if we can find a way to make some people 
better off without making anybody else worse off, we have a Pareto im¬ 
provement. If an allocation allows for a Pareto improvement, it is called 
Pareto inefficient; if an allocation is such that no Pareto improvements 
are possible, it is called Pareto efficient. 

A Pareto inefficient allocation has the undesirable feature that there is 
some way to make somebody better off without hurting anyone else. There 
may be other positive things about the allocation, but the fact that it is 
Pareto inefficient is certainly one strike against it. If there is a way to make 
someone better off without hurting anyone else, why not do it? 

The idea of Pareto efficiency is an important one in economics and we 
will examine it in some detail later on. It has many subtle implications 
that we will have to investigate more slowly, but we can get an inkling of 
what is involved even now. 

Here is a useful way to think about the idea of Pareto efficiency. Sup¬ 
pose that we assigned the renters to the inner- and outer-ring apartments 
randomly, but then allowed them to sublet their apartments to each other. 
Some people who really wanted to live close in might, through bad luck, end 
up with an outer-ring apartment. But then they could sublet an inner-ring 
apartment from someone who was assigned to such an apartment but who 
didn’t value it as highly as the other person. If individuals were assigned 
randomly to apartments, there would generally be some who would want 
to trade apartments, if they were sufficiently compensated for doing so. 

For example, suppose that person A is assigned an apartment in the inner 
ring that he feels is worth $200, and that there is some person B in the outer 
ring who would be willing to pay $300 for A’s apartment. Then there is a 


1 Pareto efficiency is named after the nineteenth-century economist and sociologist 
Vilfredo Pareto (1848-1923) who was one of the first to examine the implications of 
this idea. 
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“gain from trade” if these two agents swap apartments and arrange a side 
payment from B to A of some amount of money between $200 and $300. 
The exact amount of the transaction isn’t important. What is important 
is that the people who are willing to pay the most for the apartments get 
them—otherwise, there would be an incentive for someone who attached a 
low value to an inner-ring apartment to make a trade with someone who 
placed a high value on an inner-ring apartment. 

Suppose that we think of all voluntary trades as being carried out so 
that all gains from trade are exhausted. The resulting allocation must be 
Pareto efficient. If not, there would be some trade that would make two 
people better off without hurting anyone else—but this would contradict 
the assumption that all voluntary trades had been carried out. An alloca¬ 
tion in which all voluntary trades have been carried out is a Pareto efficient 
allocation. 


1.10 Comparing Ways to Allocate Apartments 

The trading process we’ve described above is so general that you wouldn’t 
think that anything much could be said about its outcome. But there is 
one very interesting point that can be made. Let us ask who will end up 
with apartments in an allocation where all of the gains from trade have 
been exhausted. 

To see the answer, just note that anyone who has an apartment in the 
inner ring must have a higher reservation price than anyone who has an 
apartment in the outer ring—otherwise, they could make a trade and make 
both people better off. Thus if there are S apartments to be rented, then 
the S people with the highest reservation prices end up getting apartments 
in the inner ring. This allocation is Pareto efficient—anything else is not, 
since any other assignment of apartments to people would allow for some 
trade that would make at least two of the people better off without hurting 
anyone else. 

Let us try to apply this criterion of Pareto efficiency to the outcomes of 
the various resource allocation devices mentioned above. Let’s start with 
the market mechanism. It is easy to see that the market mechanism assigns 
the people with the S highest reservation prices to the inner ring—namely, 
those people who are willing to pay more than the equilibrium price, p*, 
for their apartments. Thus there are no further gains from trade to be 
had once the apartments have been rented in a competitive market. The 
outcome of the competitive market is Pareto efficient. 

What about the discriminating monopolist? Is that arrangement Pareto 
efficient? To answer this question, simply observe that the discriminat¬ 
ing monopolist assigns apartments to exactly the same people who receive 
apartments in the competitive market. Under each system everyone who is 
willing to pay more than p* for an apartment gets an apartment. Thus the 
discriminating monopolist generates a Pareto efficient outcome as well. 
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Although both the competitive market and the discriminating monop¬ 
olist generate Pareto efficient outcomes in the sense that there will be no 
further trades desired, they can result in quite different distributions of 
income. Certainly the consumers are much worse off under the discrimi¬ 
nating monopolist than under the competitive market, and the landlord(s) 
are much better off. In general, Pareto efficiency doesn’t have much to say 
about distribution of the gains from trade. It is only concerned with the 
efficiency of the trade: whether all of the possible trades have been made. 

What about the ordinary monopolist who is constrained to charge just 
one price? It turns out that this situation is not Pareto efficient. All we 
have to do to verify this is to note that, since all the apartments will not in 
general be rented by the monopolist, he can increase his profits by renting 
an apartment to someone who doesn’t have one at any positive price. There 
is some price at which both the monopolist and the renter must be better 
off. As long as the monopolist doesn’t change the price that anybody else 
pays, the other renters are just as well off as they were before. Thus we 
have found a Pareto improvement —a way to make two parties better 
off without making anyone else worse off. 

The final case is that of rent control. This also turns out not to be Pareto 
efficient. The argument here rests on the fact that an arbitrary assignment 
of renters to apartments will generally involve someone living in the inner 
ring (say Mr. In) who is willing to pay less for an apartment than someone 
living in the outer ring (say Ms. Out). Suppose that Mr. In’s reservation 
price is $300 and Ms. Out’s reservation price is $500. 

We need to find a Pareto improvement—a way to make Mr. In and 
Ms. Out better off without hurting anyone else. But there is an easy way 
to do this: just let Mr. In sublet his apartment to Ms. Out. It is worth $500 
to Ms. Out to live close to the university, but it is only worth $300 to Mr. In. 
If Ms. Out pays Mr. In $400, say, and trades apartments, they will both be 
better off: Ms. Out will get an apartment that she values at more than $400, 
and Mr. In will get $400 that he values more than an inner-ring apartment. 

This example shows that the rent-controlled market will generally not 
result in a Pareto efficient allocation, since there will still be some trades 
that could be carried out after the market has operated. As long as some 
people get inner-ring apartments who value them less highly than people 
who don’t get them, there will be gains to be had from trade. 


1.11 Equilibrium in the Long Run 

We have analyzed the equilibrium pricing of apartments in the short run 
when there is a fixed supply of apartments. But in the long run the supply 
of apartments can change. Just as the demand curve measures the number 
of apartments that will be demanded at different prices, the supply curve 
measures the number of apartments that will be supplied at different prices. 
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The final determination of the market price for apartments will depend on 
the interaction of supply and demand. 

And what is it that determines the supply behavior? In general, the 
number of new apartments that will be supplied by the private market will 
depend on how profitable it is to provide apartments, which depends, in 
part, on the price that landlords can charge for apartments. In order to 
analyze the behavior of the apartment market in the long run, we have 
to examine the behavior of suppliers as well as demanders, a task we will 
eventually undertake. 

When supply is variable, we can ask questions not only about who gets 
the apartments, but about how many will be provided by various types of 
market institutions. Will a monopolist supply more or fewer apartments 
than a competitive market? Will rent control increase or decrease the equi¬ 
librium number of apartments? Which institutions will provide a Pareto 
efficient number of apartments? In order to answer these and similar ques¬ 
tions we must develop more systematic and powerful tools for economic 
analysis. 


Summary 

1. Economics proceeds by making models of social phenomena, which are 
simplified representations of reality. 

2. In this task, economists are guided by the optimization principle, which 
states that people typically try to choose what’s best for them, and by the 
equilibrium principle, which says that prices will adjust until demand and 
supply are equal. 

3. The demand curve measures how much people wish to demand at each 
price, and the supply curve measures how much people wish to supply at 
each price. An equilibrium price is one where the amount demanded equals 
the amount supplied. 

4. The study of how the equilibrium price and quantity change when the 
underlying conditions change is known as comparative statics. 

5. An economic situation is Pareto efficient if there is no way to make some 
group of people better off without making some other group of people worse 
off. The concept of Pareto efficiency can be used to evaluate different ways 
of allocating resources. 
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1. Suppose that there were 25 people who had a reservation price of $500, 
and the 26th person had a reservation price of $200. What would the 
demand curve look like? 

2. In the above example, what would the equilibrium price be if there were 
24 apartments to rent? What if there were 26 apartments to rent? What 
if there were 25 apartments to rent? 

3. If people have different reservation prices, why does the market demand 
curve slope down? 

4. In the text we assumed that the condominium purchasers came from 
the inner-ring people—people who were already renting apartments. What 
would happen to the price of inner-ring apartments if all of the condo¬ 
minium purchasers were outer-ring people—the people who were not cur¬ 
rently renting apartments in the inner ring? 

5. Suppose now that the condominium purchasers were all inner-ring peo¬ 
ple, but that each condominium was constructed from two apartments. 
What would happen to the price of apartments? 

6. What do you suppose the effect of a tax would be on the number of 
apartments that would be built in the long run? 

7. Suppose the demand curve is D(p) = 100 — 2 p. What price would the 
monopolist set if he had 60 apartments? How many would he rent? What 
price would he set if he had 40 apartments? How many would he rent? 

8. If our model of rent control allowed for unrestricted subletting, who 
would end up getting apartments in the inner circle? Would the outcome 
be Pareto efficient? 


CHAPTER 



BUDGET 

CONSTRAINT 


The economic theory of the consumer is very simple: economists assume 
that consumers choose the best bundle of goods they can afford. To give 
content to this theory, we have to describe more precisely what we mean by 
“best” and what we mean by “can afford.” In this chapter we will examine 
how to describe what a consumer can afford; the next chapter will focus on 
the concept of how the consumer determines what is best. We will then be 
able to undertake a detailed study of the implications of this simple model 
of consumer behavior. 


2.1 The Budget Constraint 

We begin by examining the concept of the budget constraint. Suppose 
that there is some set of goods from which the consumer can choose. In 
real life there are many goods to consume, but for our purposes it is conve¬ 
nient to consider only the case of two goods, since we can then depict the 
consumer’s choice behavior graphically. 

We will indicate the consumer’s consumption bundle by (xi, a^)- This 
is simply a list of two numbers that tells us how much the consumer is choos¬ 
ing to consume of good 1, xi, and how much the consumer is choosing to 
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consume of good 2, X 2 ■ Sometimes it is convenient to denote the consumer’s 
bundle by a single symbol like X , where X is simply an abbreviation for 
the list of two numbers (xi,X 2 )- 

We suppose that we can observe the prices of the two goods, (pi,P 2 ), 
and the amount of money the consumer has to spend, m. Then the budget 
constraint of the consumer can be written as 


P1X1 +P2X2 < m. ( 2 . 1 ) 

Here P\X\ is the amount of money the consumer is spending on good 1, 
and P 2 X 2 is the amount of money the consumer is spending on good 2. 
The budget constraint of the consumer requires that the amount of money 
spent on the two goods be no more than the total amount the consumer has 
to spend. The consumer’s affordable consumption bundles are those that 
don’t cost any more than m. We call this set of affordable consumption 
bundles at prices (pi,P 2 ) and income m the budget set of the consumer. 


2.2 Two Goods Are Often Enough 

The two-good assumption is more general than you might think at first, 
since we can often interpret one of the goods as representing everything 
else the consumer might want to consume. 

For example, if we are interested in studying a consumer’s demand for 
milk, we might let X\ measure his or her consumption of milk in quarts per 
month. We can then let X 2 stand for everything else the consumer might 
want to consume. 

When we adopt this interpretation, it is convenient to think of good 2 
as being the dollars that the consumer can use to spend on other goods. 
Under this interpretation the price of good 2 will automatically be 1, since 
the price of one dollar is one dollar. Thus the budget constraint will take 
the form 

P1X1 + X2 < to . ( 2 - 2 ) 

This expression simply says that the amount of money spent on good 1, 
p iii, plus the amount of money spent on all other goods, X 2 , must be no 
more than the total amount of money the consumer has to spend, m. 

We say that good 2 represents a composite good that stands for ev¬ 
erything else that the consumer might want to consume other than good 
1. Such a composite good is invariably measured in dollars to be spent 
on goods other than good 1. As far as the algebraic form of the budget 
constraint is concerned, equation (2.2) is just a special case of the formula 
given in equation (2.1), with P 2 = 1, so everything that we have to say 
about the budget constraint in general will hold under the composite-good 
interpretation. 
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Figure 

2.1 


2.3 Properties of the Budget Set 

The budget line is the set of bundles that cost exactly m: 


PiX\+p2X2=m. (2.3) 

These are the bundles of goods that just exhaust the consumer’s income. 

The budget set is depicted in Figure 2.1. The heavy line is the budget 
line—the bundles that cost exactly m —and the bundles below this line are 
those that cost strictly less than m. 



The budget set. The budget set consists of all bundles that 
are affordable at the given prices and income. 


We can rearrange the budget line in equation (2.3) to give us the formula 


m pi 

x 2 = - Xi- 

P2 P2 


(2.4) 


This is the formula for a straight line with a vertical intercept of m/p 2 
and a slope of —p\ /P 2 ■ The formula tells us how many units of good 2 the 
consumer needs to consume in order to just satisfy the budget constraint 
if she is consuming X\ units of good 1. 
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Here is an easy way to draw a budget line given prices (pi,p 2 ) and income 
m. Just ask yourself how much of good 2 the consumer could buy if she 
spent all of her money on good 2. The answer is, of course, ?n/p 2 . Then 
ask how much of good 1 the consumer could buy if she spent all of her 
money on good 1. The answer is m/p\. Thus the horizontal and vertical 
intercepts measure how much the consumer could get if she spent all of her 
money on goods 1 and 2, respectively. In order to depict the budget line 
just plot these two points on the appropriate axes of the graph and connect 
them with a straight line. 

The slope of the budget line has a nice economic interpretation. It mea¬ 
sures the rate at which the market is willing to “substitute” good 1 for 
good 2. Suppose for example that the consumer is going to increase her 
consumption of good 1 by Ai’i. 1 How much will her consumption of good 
2 have to change in order to satisfy her budget constraint? Let us use Aa; 2 
to indicate her change in the consumption of good 2. 

Now note that if she satisfies her budget constraint before and after 
making the change she must satisfy 


Pixi + P2X2 = m 


and 


Pi{x\ + Aaq) T P2 (x 2 + Ax 2 ) = m. 


Subtracting the first equation from the second gives 


PiA^i +P 2 Ax 2 = 0. 

This says that the total value of the change in her consumption must be 
zero. Solving for Aa^/Aaq, the rate at which good 2 can be substituted 
for good 1 while still satisfying the budget constraint, gives 

Ax 2 _ _Pi 

Aaq p 2 ' 

This is just the slope of the budget line. The negative sign is there since 
Aaq and Aa: 2 must always have opposite signs. If you consume more of 
good 1, you have to consume less of good 2 and vice versa if you continue 
to satisfy the budget constraint. 

Economists sometimes say that the slope of the budget line measures 
the opportunity cost of consuming good 1. In order to consume more of 
good 1 you have to give up some consumption of good 2. Giving up the 
opportunity to consume good 2 is the true economic cost of more good 1 
consumption; and that cost is measured by the slope of the budget line. 

1 The Greek letter A, delta, is pronounced “del-ta.” The notation Aaq denotes the 
change in good 1. For more on changes and rates of changes, see the Mathematical 
Appendix. 
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2.4 How the Budget Line Changes 

When prices and incomes change, the set of goods that a consumer can 
afford changes as well. How do these changes affect the budget set? 

Let us first consider changes in income. It is easy to see from equation 
(2.4) that an increase in income will increase the vertical intercept and not 
affect the slope of the line. Thus an increase in income will result in a par¬ 
allel shift outward of the budget line as in Figure 2.2. Similarly, a decrease 
in income will cause a parallel shift inward. 


Figure 

2.2 


*2 

m'/pi 

Budget lines 

m/Pi 

Slope = -p/p 2 


m /p, m'/pi x 1 

Increasing income. An increase in income causes a parallel 
shift outward of the budget line. 


What about changes in prices? Let us first consider increasing price 
1 while holding price 2 and income fixed. According to equation (2.4), 
increasing p\ will not change the vertical intercept, but it will make the 
budget line steeper since P 1 /P 2 will become larger. 

Another way to see how the budget line changes is to use the trick de¬ 
scribed earlier for drawing the budget line. If you are spending all of 
your money on good 2, then increasing the price of good 1 doesn’t change 
the maximum amount of good 2 you could buy— thus the vertical inter¬ 
cept of the budget line doesn’t change. But if you are spending all of 
your money on good 1, and good 1 becomes more expensive, then your 
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consumption of good 1 must decrease. Thus the horizontal intercept of 
the budget line must shift inward, resulting in the tilt depicted in Fig¬ 
ure 2.3. 



Increasing price. If good 1 becomes more expensive, the 
budget line becomes steeper. 


Figure 

2.3 


What happens to the budget line when we change the prices of good 1 
and good 2 at the same time? Suppose for example that we double the 
prices of both goods 1 and 2. In this case both the horizontal and vertical 
intercepts shift inward by a factor of one-half, and therefore the budget 
line shifts inward by one-half as well. Multiplying both prices by two is 
just like dividing income by 2. 

We can also see this algebraically. Suppose our original budget line is 


PiXi +P 2 X 2 = m. 


Now suppose that both prices become t times as large. Multiplying both 
prices by t yields 

tpiXi + tp 2 x 2 = m. 

But this equation is the same as 


TO 

P1X1 +P2X2 = —■ 

Thus multiplying both prices by a constant amount t is just like dividing 
income by the same constant t. It follows that if we multiply both prices 
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by t and we multiply income by t , then the budget line won’t change at 
all. 

We can also consider price and income changes together. What happens 
if both prices go up and income goes down? Think about what happens to 
the horizontal and vertical intercepts. If m decreases and pi and P 2 both 
increase, then the intercepts m/pi and m/p 2 must both decrease. This 
means that the budget line will shift inward. What about the slope of 
the budget line? If price 2 increases more than price 1, so that —pi/p 2 
decreases (in absolute value), then the budget line will be flatter; if price 2 
increases less than price 1, the budget line will be steeper. 


2.5 The Numeraire 

The budget line is defined by two prices and one income, but one of these 
variables is redundant. We could peg one of the prices, or the income, to 
some fixed value, and adjust the other variables so as to describe exactly 
the same budget set. Thus the budget line 


Pixi + P2X2 = m 


is exactly the same budget line as 


Pi m 

—Xi + X2 = — 
P2 P2 


or 


Pi , P2 n 

— X\ H- X2 = 1, 

m m 


since the first budget line results from dividing everything by p 2 1 and the 
second budget line results from dividing everything by m. In the first case, 
we have pegged P 2 = 1, and in the second case, we have pegged m = 1. 
Pegging the price of one of the goods or income to 1 and adjusting the 
other price and income appropriately doesn’t change the budget set at all. 

When we set one of the prices to 1, as we did above, we often refer to that 
price as the numeraire price. The numeraire price is the price relative to 
which we are measuring the other price and income. It will occasionally be 
convenient to think of one of the goods as being a numeraire good, since 
there will then be one less price to worry about. 


2.6 Taxes, Subsidies, and Rationing 

Economic policy often uses tools that affect a consumer’s budget constraint, 
such as taxes. For example, if the government imposes a quantity tax, this 
means that the consumer has to pay a certain amount to the government 
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for each unit of the good he purchases. In the U.S., for example, we pay 
about 15 cents a gallon as a federal gasoline tax. 

How does a quantity tax affect the budget line of a consumer? From 
the viewpoint of the consumer the tax is just like a higher price. Thus a 
quantity tax of t dollars per unit of good 1 simply changes the price of good 

1 from pi to pi + 1. As we’ve seen above, this implies that the budget line 
must get steeper. 

Another kind of tax is a value tax. As the name implies this is a tax 
on the value—the price—of a good, rather than the quantity purchased of 
a good. A value tax is usually expressed in percentage terms. Most states 
in the U.S. have sales taxes. If the sales tax is 6 percent, then a good that 
is priced at $1 will actually sell for $1.06. (Value taxes are also known as 
ad valorem taxes.) 

If good 1 has a price of pi but is subject to a sales tax at rate r, then 
the actual price facing the consumer is (1 + r)pi? The consumer has to 
pay pi to the supplier and rpi to the government for each unit of the good 
so the total cost of the good to the consumer is (1 + r)pi. 

A subsidy is the opposite of a tax. In the case of a quantity subsidy, 
the government gives an amount to the consumer that depends on the 
amount of the good purchased. If, for example, the consumption of milk 
were subsidized, the government would pay some amount of money to each 
consumer of milk depending on the amount that consumer purchased. If 
the subsidy is s dollars per unit of consumption of good 1, then from the 
viewpoint of the consumer, the price of good 1 would be pi — s. This would 
therefore make the budget line flatter. 

Similarly an ad valorem subsidy is a subsidy based on the price of the 
good being subsidized. If the government gives you back $1 for every $2 
you donate to charity, then your donations to charity are being subsidized 
at a rate of 50 percent. In general, if the price of good 1 is pi and good 1 is 
subject to an ad valorem subsidy at rate <r, then the actual price of good 1 
facing the consumer is (1 — cr)pi. 

You can see that taxes and subsidies affect prices in exactly the same 
way except for the algebraic sign: a tax increases the price to the consumer, 
and a subsidy decreases it. 

Another kind of tax or subsidy that the government might use is a lump¬ 
sum tax or subsidy. In the case of a tax, this means that the government 
takes away some fixed amount of money, regardless of the individual’s be¬ 
havior. Thus a lump-sum tax means that the budget line of a consumer 
will shift inward because his money income has been reduced. Similarly, a 
lump-sum subsidy means that the budget line will shift outward. Quantity 
taxes and value taxes tilt the budget line one way or the other depending 

2 The Greek letter r, tau, rhymes with “wow.” 

3 The Greek letter a is pronounced “sig-ma.” 
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on which good is being taxed, but a lump-sum tax shifts the budget line 
inward. 

Governments also sometimes impose rationing constraints. This means 
that the level of consumption of some good is fixed to be no larger than 
some amount. For example, during World War II the U.S. government 
rationed certain foods like butter and meat. 

Suppose, for example, that good 1 were rationed so that no more than 
tS\ could be consumed by a given consumer. Then the budget set of the 
consumer would look like that depicted in Figure 2.4: it would be the old 
budget set with a piece lopped off. The lopped-off piece consists of all the 
consumption bundles that are affordable but have aq > aq. 


Figure 

2.4 



Budget set with rationing. If good 1 is rationed, the section 
of the budget set beyond the rationed quantity will be lopped 
off. 


Sometimes taxes, subsidies, and rationing are combined. For example, 
we could consider a situation where a consumer could consume good 1 
at a price of p\ up to some level 5q, and then had to pay a tax t on all 
consumption in excess of aq . The budget set for this consumer is depicted 
in Figure 2.5. Here the budget line has a slope of —P 1 /P 2 to the left of aq, 
and a slope of — (p± + t)/p 2 to the right of aq. 
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Taxing consumption greater than x\. In this budget set 
the consumer must pay a tax only on the consumption of good 
1 that is in excess of x\, so the budget line becomes steeper to 
the right of X\. 


Figure 

2.5 


EXAMPLE: The Food Stamp Program 

Since the Food Stamp Act of 1964 the U.S. federal government has provided 
a subsidy on food for poor people. The details of this program have been 
adjusted several times. Here we will describe the economic effects of one 
of these adjustments. 

Before 1979, households who met certain eligibility requirements were 
allowed to purchase food stamps, which could then be used to purchase food 
at retail outlets. In January 1975, for example, a family of four could receive 
a maximum monthly allotment of $153 in food coupons by participating in 
the program. 

The price of these coupons to the household depended on the household 
income. A family of four with an adjusted monthly income of $300 paid 
$83 for the full monthly allotment of food stamps. If a family of four had 
a monthly income of $100, the cost for the full monthly allotment would 
have been $25. 4 

The pre-1979 Food Stamp program was an ad valorem subsidy on food. 
The rate at which food was subsidized depended on the household income. 


4 


These figures are taken from Kenneth Clarkson, Food Stamps and Nutrition , Ameri¬ 
can Enterprise Institute, 1975. 
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The family of four that was charged $83 for their allotment paid $1 to 
receive $1.84 worth of food (1.84 equals 153 divided by 83). Similarly, the 
household that paid $25 was paying $1 to receive $6.12 worth of food (6.12 
equals 153 divided by 25). 

The way that the Food Stamp program affected the budget set of a 
household is depicted in Figure 2.6A. Here we have measured the amount 
of money spent on food on the horizontal axis and expenditures on all other 
goods on the vertical axis. Since we are measuring each good in terms of 
the money spent on it, the “price” of each good is automatically 1, and the 
budget line will therefore have a slope of —1. 

If the household is allowed to buy $153 of food stamps for $25, then this 
represents roughly an 84 percent (= 1 — 25/153) subsidy of food purchases, 
so the budget line will have a slope of roughly —.16 (= 25/153) until the 
household has spent $153 on food. Each dollar that the household spends 
on food up to $153 would reduce its consumption of other goods by about 
16 cents. After the household spends $153 on food, the budget line facing 
it would again have a slope of —1. 


Figure 

2.6 


OTHER 

GOODS 




Food stamps. How the budget line is affected by the Food 
Stamp program. Part A shows the pre-1979 program and part 
B the post-1979 program. 


These effects lead to the kind of “kink” depicted in Figure 2.6. House¬ 
holds with higher incomes had to pay more for their allotment of food 
stamps. Thus the slope of the budget line would become steeper as house¬ 
hold income increased. 

In 1979 the Food Stamp program was modified. Instead of requiring that 
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households purchase food stamps, they are now simply given to qualified 
households. Figure 2.6B shows how this affects the budget set. 

Suppose that a household now receives a grant of $200 of food stamps a 
month. Then this means that the household can consume $200 more food 
per month, regardless of how much it is spending on other goods, which 
implies that the budget line will shift to the right by $200. The slope 
will not change: $1 less spent on food would mean $1 more to spend on 
other things. But since the household cannot legally sell food stamps, the 
maximum amount that it can spend on other goods does not change. The 
Food Stamp program is effectively a lump-sum subsidy, except for the fact 
that the food stamps can’t be sold. 


2.7 Budget Line Changes 

In the next chapter we will analyze how the consumer chooses an optimal 
consumption bundle from his or her budget set. But we can already state 
some observations here that follow from what we have learned about the 
movements of the budget line. 

First, we can observe that since the budget set doesn’t change when we 
multiply all prices and income by a positive number, the optimal choice of 
the consumer from the budget set can’t change either. Without even ana¬ 
lyzing the choice process itself, we have derived an important conclusion: 
a perfectly balanced inflation—one in which all prices and all incomes rise 
at the same rate—doesn’t change anybody’s budget set, and thus cannot 
change anybody’s optimal choice. 

Second, we can make some statements about how well-off the consumer 
can be at different prices and incomes. Suppose that the consumer’s income 
increases and all prices remain the same. We know that this represents a 
parallel shift outward of the budget line. Thus every bundle the consumer 
was consuming at the lower income is also a possible choice at the higher 
income. But then the consumer must be at least as well-off at the higher 
income as at the lower income—since he or she has the same choices avail¬ 
able as before plus some more. Similarly, if one price declines and all others 
stay the same, the consumer must be at least as well-off. This simple ob¬ 
servation will be of considerable use later on. 


Summary 

1. The budget set consists of all bundles of goods that the consumer can 
afford at given prices and income. We will typically assume that there are 
only two goods, but this assumption is more general than it seems. 

2. The budget line is written as p\X\ +P2X2 = m. It has a slope of —p±/p2, 
a vertical intercept of m/p 2 , and a horizontal intercept of m/p\. 
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3. Increasing income shifts the budget line outward. Increasing the price 
of good 1 makes the budget line steeper. Increasing the price of good 2 
makes the budget line flatter. 

4. Taxes, subsidies, and rationing change the slope and position of the 
budget line by changing the prices paid by the consumer. 

REVIEW QUESTIONS 


1. Originally the consumer faces the budget line piX\ + P 2 X 2 = m. Then 
the price of good 1 doubles, the price of good 2 becomes 8 times larger, 
and income becomes 4 times larger. Write down an equation for the new 
budget line in terms of the original prices and income. 

2. What happens to the budget line if the price of good 2 increases, but 
the price of good 1 and income remain constant? 

3. If the price of good 1 doubles and the price of good 2 triples, does the 
budget line become flatter or steeper? 

4. What is the definition of a numeraire good? 

5. Suppose that the government puts a tax of 15 cents a gallon on gasoline 
and then later decides to put a subsidy on gasoline at a rate of 7 cents a 
gallon. What net tax is this combination equivalent to? 

6. Suppose that a budget equation is given by p\X\ + P 2 X 2 = m. The 
government decides to impose a lump-sum tax of u, a quantity tax on 
good 1 of t, and a quantity subsidy on good 2 of s. What is the formula 
for the new budget line? 

7. If the income of the consumer increases and one of the prices decreases 
at the same time, will the consumer necessarily be at least as well-off? 


CHAPTER 



PREFERENCES 


We saw in Chapter 2 that the economic model of consumer behavior is very 
simple: people choose the best things they can afford. The last chapter was 
devoted to clarifying the meaning of “can afford,” and this chapter will be 
devoted to clarifying the economic concept of “best things.” 

We call the objects of consumer choice consumption bundles. This 
is a complete list of the goods and services that are involved in the choice 
problem that we are investigating. The word “complete” deserves empha¬ 
sis: when you analyze a consumer’s choice problem, make sure that you 
include all of the appropriate goods in the definition of the consumption 
bundle. 

If we are analyzing consumer choice at the broadest level, we would want 
not only a complete list of the goods that a consumer might consume, but 
also a description of when, where, and under what circumstances they 
would become available. After all, people care about how much food they 
will have tomorrow as well as how much food they have today. A raft in the 
middle of the Atlantic Ocean is very different from a raft in the middle of 
the Sahara Desert. And an umbrella when it is raining is quite a different 
good from an umbrella on a sunny day. It is often useful to think of the 
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“same” good available in different locations or circumstances as a different 
good, since the consumer may value the good differently in those situations. 

However, when we limit our attention to a simple choice problem, the 
relevant goods are usually pretty obvious. We’ll often adopt the idea de¬ 
scribed earlier of using just two goods and calling one of them “all other 
goods” so that we can focus on the tradeoff between one good and ev¬ 
erything else. In this way we can consider consumption choices involving 
many goods and still use two-dimensional diagrams. 

So let us take our consumption bundle to consist of two goods, and let 
*1 denote the amount of one good and *2 the amount of the other. The 
complete consumption bundle is therefore denoted by (*i,* 2 )■ As noted 
before, we will occasionally abbreviate this consumption bundle by X. 


3.1 Consumer Preferences 

We will suppose that given any two consumption bundles, (*i,* 2 ) and 
(j/i, j/ 2 ), the consumer can rank them as to their desirability. That is, the 
consumer can determine that one of the consumption bundles is strictly 
better than the other, or decide that she is indifferent between the two 
bundles. 

We will use the symbol >- to mean that one bundle is strictly preferred 
to another, so that (*i,* 2 ) >~ ( 2 / 1 , 2 / 2 ) should be interpreted as saying that 
the consumer strictly prefers (* 1 , 2 : 2 ) to ( 2 / 1 , 2 / 2 ), in the sense that she 
definitely wants the x-bundle rather than the y-bundle. This preference 
relation is meant to be an operational notion. If the consumer prefers 
one bundle to another, it means that he or she would choose one over the 
other, given the opportunity. Thus the idea of preference is based on the 
consumer’s behavior. In order to tell whether one bundle is preferred to 
another, we see how the consumer behaves in choice situations involving 
the two bundles. If she always chooses (* 1 , 2 : 2 ) when ( 2 / 1 , 2 / 2 ) is available, 
then it is natural to say that this consumer prefers (*i,* 2 ) to (z/i, 2 / 2 )- 

If the consumer is indifferent between two bundles of goods, we use 
the symbol ~ and write (*i,* 2 ) ~ ( 2/1 > 2 / 2 )- Indifference means that the 
consumer would be just as satisfied, according to her own preferences, 
consuming the bundle (* 1 , * 2 ) as she would be consuming the other bundle, 

(2/13 2/2) - 

If the consumer prefers or is indifferent between the two bundles we say 
that she weakly prefers (*i,* 2 ) to ( 221 , 2 / 2 ) and write (*i,* 2 ) cl ( 2 / 1 , 2 / 2 )- 

These relations of strict preference, weak preference, and indifference 
are not independent concepts; the relations are themselves related! For 
example, if (*i,* 2 ) cl ( 2/1 ? 2 / 2 ) and ( 2 / 1 , 222 ) cl ( 2 : 1 ,* 2 ) we can conclude that 
(* 1 , * 2 ) ~ ( 221 , 2 / 2 )- That is, if the consumer thinks that (*i,* 2 ) is at least 
as good as ( 2 / 1 , 222 ) and that ( 221 , 2 / 2 ) is at least as good as (cci, * 2 ), then the 
consumer must be indifferent between the two bundles of goods. 
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Similarly, if (xi,x 2 ) y ( 2 / 1 , 2 / 2 ) but we know that it is not the case that 
(xi,x 2 ) ~ (y 1 , 2 / 2 ), we can conclude that we must have (xi,x 2 ) >- ( 221 , 2 / 2 )- 
This just says that if the consumer thinks that (xi,x 2 ) is at least as good 
as ( 2 / 1 , 2 / 2 ), and she is not indifferent between the two bundles, then it must 
be that she thinks that (xi,x 2 ) is strictly better than ( 221 , 2 / 2 )- 


3.2 Assumptions about Preferences 

Economists usually make some assumptions about the “consistency” of 
consumers’ preferences. For example, it seems unreasonable—not to say 
contradictory—to have a situation where (xi,x 2 ) >~ ( 2 / 1 , 222 ) and, at the 
same time, ( 221 , 2 / 2 ) >- (xi,x 2 ). For this would mean that the consumer 
strictly prefers the x-bundle to the y-bundle ... and vice versa. 

So we usually make some assumptions about how the preference relations 
work. Some of the assumptions about preferences are so fundamental that 
we can refer to them as “axioms” of consumer theory. Here are three such 
axioms about consumer preference. 

Complete. We assume that any two bundles can be compared. That is, 
given any x-bundle and any y-bundle, we assume that (xi,x 2 ) h ( 221 , 2 / 2 ), 
or ( 221 , 2 / 2 ) y (xi,x 2 ), or both, in which case the consumer is indifferent 
between the two bundles. 

Reflexive. We assume that any bundle is at least as good as itself: 
(xi,x 2 ) h (xi,x 2 ). 


Transitive. If (xi,x 2 ) h ( 221 , 2 / 2 ) and ( 2 / 1 , 222 ) h (- 21 ,- 22 ), then we assume 
that (xi, x 2 ) y (z\, z 2 ). In other words, if the consumer thinks that X is at 
least as good as Y and that Y is at least as good as Z, then the consumer 
thinks that X is at least as good as Z. 

The first axiom, completeness, is hardly objectionable, at least for the 
kinds of choices economists generally examine. To say that any two bundles 
can be compared is simply to say that the consumer is able to make a choice 
between any two given bundles. One might imagine extreme situations 
involving life or death choices where ranking the alternatives might be 
difficult, or even impossible, but these choices are, for the most part, outside 
the domain of economic analysis. 

The second axiom, reflexivity, is trivial. Any bundle is certainly at least 
as good as an identical bundle. Parents of small children may occasionally 
observe behavior that violates this assumption, but it seems plausible for 
most adult behavior. 

The third axiom, transitivity, is more problematic. It isn’t clear that 
transitivity of preferences is necessarily a property that preferences would 
have to have. The assumption that preferences are transitive doesn’t seem 
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compelling on grounds of pure logic alone. In fact it’s not. Transitivity is 
a hypothesis about people’s choice behavior, not a statement of pure logic. 
Whether it is a basic fact of logic or not isn’t the point: it is whether or not 
it is a reasonably accurate description of how people behave that matters. 

What would you think about a person who said that he preferred a 
bundle X to Y, and preferred Y to Z , but then also said that he preferred 
Z to XI This would certainly be taken as evidence of peculiar behavior. 

More importantly, how would this consumer behave if faced with choices 
among the three bundles X, Y, and Z? If we asked him to choose his most 
preferred bundle, he would have quite a problem, for whatever bundle he 
chose, there would always be one that was preferred to it. If we are to have 
a theory where people are making “best” choices, preferences must satisfy 
the transitivity axiom or something very much like it. If preferences were 
not transitive there could well be a set of bundles for which there is no best 
choice. 


3.3 Indifference Curves 

It turns out that the whole theory of consumer choice can be formulated 
in terms of preferences that satisfy the three axioms described above, plus 
a few more technical assumptions. However, we will find it convenient to 
describe preferences graphically by using a construction known as indif¬ 
ference curves. 

Consider Figure 3.1 where we have illustrated two axes representing a 
consumer’s consumption of goods 1 and 2. Let us pick a certain consump¬ 
tion bundle {x 1 , 2 : 2 ) and shade in all of the consumption bundles that are 
weakly preferred to (xi, X 2 )• This is called the weakly preferred set. The 
bundles on the boundary of this set—the bundles for which the consumer 
is just indifferent to ( Xi,X 2 )—form the indifference curve. 

We can draw an indifference curve through any consumption bundle we 
want. The indifference curve through a consumption bundle consists of all 
bundles of goods that leave the consumer indifferent to the given bundle. 

One problem with using indifference curves to describe preferences is 
that they only show you the bundles that the consumer perceives as being 
indifferent to each other—they don’t show you which bundles are better 
and which bundles are worse. It is sometimes useful to draw small arrows 
on the indifference curves to indicate the direction of the preferred bundles. 
We won’t do this in every case, but we will do it in a few of the examples 
where confusion might arise. 

If we make no further assumptions about preferences, indifference curves 
can take very peculiar shapes indeed. But even at this level of generality, 
we can state an important principle about indifference curves: indifference 
curves representing distinct levels of preference cannot cross. That is, the 
situation depicted in Figure 3.2 cannot occur. 
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Weakly preferred set: 
bundles weakly 
preferred to (x 1; x 2 ) 


Indifference 
curve: 
bundles 
indifferent 
to (*,, x 2 ) 


*1 


*1 


Weakly preferred set. The shaded area consists of all bun¬ 
dles that are at least as good as the bundle 


Figure 

3.1 


In order to prove this, let us choose three bundles of goods, X, Y, and 
Z , such that X lies only on one indifference curve, Y lies only on the other 
indifference curve, and Z lies at the intersection of the indifference curves. 
By assumption the indifference curves represent distinct levels of prefer¬ 
ence, so one of the bundles, say X , is strictly preferred to the other bundle, 
Y. We know that X ~ Z and Z ~ Y, and the axiom of transitivity there¬ 
fore implies that X ~ Y. But this contradicts the assumption that X >- Y. 
This contradiction establishes the result—indifference curves representing 
distinct levels of preference cannot cross. 

What other properties do indifference curves have? In the abstract, the 
answer is: not many. Indifference curves are a way to describe preferences. 
Nearly any “reasonable” preferences that you can think of can be depicted 
by indifference curves. The trick is to learn what kinds of preferences give 
rise to what shapes of indifference curves. 


3.4 Examples of Preferences 

Let us try to relate preferences to indifference curves through some exam¬ 
ples. We’ll describe some preferences and then see what the indifference 
curves that represent them look like. 
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Figure 

3.2 


x 2 



Indifference curves cannot cross. If they did, X , Y, and 
Z would all have to be indifferent to each other and thus could 
not lie on distinct indifference curves. 


There is a general procedure for constructing indifference curves given 
a “verbal” description of the preferences. First plop your pencil down on 
the graph at some consumption bundle ( xi,x 2 )■ Now think about giving a 
little more of good 1, Ax±, to the consumer, moving him to ( x\ + Aaq, x 2 ). 
Now ask yourself how would you have to change the consumption of X 2 
to make the consumer indifferent to the original consumption point? Call 
this change Ax 2 . Ask yourself the question “For a given change in good 
1, how does good 2 have to change to make the consumer just indifferent 
between {x\ + Axi,x 2 + Ax 2 ) and (xi,x 2 )?” Once you have determined 
this movement at one consumption bundle you have drawn a piece of the 
indifference curve. Now try it at another bundle, and so on, until you 
develop a clear picture of the overall shape of the indifference curves. 


Perfect Substitutes 

Two goods are perfect substitutes if the consumer is willing to substitute 
one good for the other at a constant rate. The simplest case of perfect 
substitutes occurs when the consumer is willing to substitute the goods on 
a one-to-one basis. 

Suppose, for example, that we are considering a choice between red pen¬ 
cils and blue pencils, and the consumer involved likes pencils, but doesn’t 
care about color at all. Pick a consumption bundle, say (10,10). Then for 
this consumer, any other consumption bundle that has 20 pencils in it is 
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just as good as (10,10). Mathematically speaking, any consumption bun¬ 
dle ( Xi,X 2 ) such that X\ + *2 = 20 will be on this consumer’s indifference 
curve through (10,10). Thus the indifference curves for this consumer are 
all parallel straight lines with a slope of —1, as depicted in Figure 3.3. 
Bundles with more total pencils are preferred to bundles with fewer total 
pencils, so the direction of increasing preference is up and to the right, as 
illustrated in Figure 3.3. 

How does this work in terms of general procedure for drawing indifference 
curves? If we are at (10,10), and we increase the amount of the first good 
by one unit to 11, how much do we have to change the second good to get 
back to the original indifference curve? The answer is clearly that we have 
to decrease the second good by 1 unit. Thus the indifference curve through 
(10,10) has a slope of —1. The same procedure can be carried out at any 
bundle of goods with the same results—in this case all the indifference 
curves have a constant slope of —1. 


*2 



Perfect substitutes. The consumer only cares about the total 
number of pencils, not about their colors. Thus the indifference 
curves are straight lines with a slope of —1. 


Figure 

3.3 


The important fact about perfect substitutes is that the indifference 
curves have a constant slope. Suppose, for example, that we graphed blue 
pencils on the vertical axis and pairs of red pencils on the horizontal axis. 
The indifference curves for these two goods would have a slope of —2, since 
the consumer would be willing to give up two blue pencils to get one more 
pair of red pencils. 
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In the textbook we’ll primarily consider the case where goods are perfect 
substitutes on a one-for-one basis, and leave the treatment of the general 
case for the workbook. 


Perfect Complements 

Perfect complements are goods that are always consumed together in 
fixed proportions. In some sense the goods “complement” each other. A 
nice example is that of right shoes and left shoes. The consumer likes shoes, 
but always wears right and left shoes together. Having only one out of a 
pair of shoes doesn’t do the consumer a bit of good. 

Let us draw the indifference curves for perfect complements. Suppose 
we pick the consumption bundle (10,10). Now add 1 more right shoe, so 
we have (11,10). By assumption this leaves the consumer indifferent to 
the original position: the extra shoe doesn’t do him any good. The same 
thing happens if we add one more left shoe: the consumer is also indifferent 
between (10,11) and (10,10). 

Thus the indifference curves are L-shaped, with the vertex of the L oc¬ 
curring where the number of left shoes equals the number of right shoes as 
in Figure 3.4. 


LEFT SHOES 




Indifference 

curves 



RIGHT SHOES 


Figure 

3.4 


Perfect complements. The consumer always wants to con¬ 
sume the goods in fixed proportions to each other. Thus the 
indifference curves are L-shaped. 
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Increasing both the number of left shoes and the number of right shoes 
at the same time will move the consumer to a more preferred position, 
so the direction of increasing preference is again up and to the right, as 
illustrated in the diagram. 

The important thing about perfect complements is that the consumer 
prefers to consume the goods in fixed proportions, not necessarily that 
the proportion is one-to-one. If a consumer always uses two teaspoons of 
sugar in her cup of tea, and doesn’t use sugar for anything else, then the 
indifference curves will still be L-shaped. In this case the corners of the 
L will occur at (2 teaspoons sugar, 1 cup tea), (4 teaspoons sugar, 2 cups 
tea) and so on, rather than at (1 right shoe, 1 left shoe), (2 right shoes, 2 
left shoes), and so on. 

In the textbook we’ll primarily consider the case where the goods are 
consumed in proportions of one-for-one and leave the treatment of the 
general case for the workbook. 


Bads 

A bad is a commodity that the consumer doesn’t like. For example, sup¬ 
pose that the commodities in question are now pepperoni and anchovies— 
and the consumer loves pepperoni but dislikes anchovies. But let us suppose 
there is some possible tradeoff between pepperoni and anchovies. That is, 
there would be some amount of pepperoni on a pizza that would compen¬ 
sate the consumer for having to consume a given amount of anchovies. How 
could we represent these preferences using indifference curves? 

Pick a bundle (xi, X 2 ) consisting of some pepperoni and some anchovies. 
If we give the consumer more anchovies, what do we have to do with the 
pepperoni to keep him on the same indifference curve? Clearly, we have 
to give him some extra pepperoni to compensate him for having to put up 
with the anchovies. Thus this consumer must have indifference curves that 
slope up and to the right as depicted in Figure 3.5. 

The direction of increasing preference is down and to the right—that 
is, toward the direction of decreased anchovy consumption and increased 
pepperoni consumption, just as the arrows in the diagram illustrate. 


Neutrals 

A good is a neutral good if the consumer doesn’t care about it one way 
or the other. What if a consumer is just neutral about anchovies? 1 In this 
case his indifference curves will be vertical lines as depicted in Figure 3.6. 


1 


Is anybody neutral about anchovies? 
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Figure 

3.5 



Bads. Here anchovies are a “bad,” and pepperoni is a “good” 
for this consumer. Thus the indifference curves have a positive 
slope. 


Figure 

3.6 



A neutral good. The consumer likes pepperoni but is neutral 
about anchovies, so the indifference curves are vertical lines. 


He only cares about the amount of pepperoni he has and doesn’t care at 
all about how many anchovies he has. The more pepperoni the better, but 
adding more anchovies doesn’t affect him one way or the other. 
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Satiation 

We sometimes want to consider a situation involving satiation, where 
there is some overall best bundle for the consumer, and the “closer” he is 
to that best bundle, the better off he is in terms of his own preferences. 
For example, suppose that the consumer has some most preferred bundle 
of goods (xi,X 2 ), and the farther away he is from that bundle, the worse 
off he is. In this case we say that (xi,xz) is a satiation point, or a bliss 
point. The indifference curves for the consumer look like those depicted in 
Figure 3.7. The best point is (ah, $ 2 ) and points farther away from this 
bliss point lie on “lower” indifference curves. 



Satiated preferences. The bundle (ah, £ 2 ) is the satiation 
point or bliss point, and the indifference curves surround this 
point. 


In this case the indifference curves have a negative slope when the con¬ 
sumer has “too little” or “too much” of both goods, and a positive slope 
when he has “too much” of one of the goods. When he has too much of one 
of the goods, it becomes a bad—reducing the consumption of the bad good 
moves him closer to his “bliss point.” If he has too much of both goods, 
they both are bads, so reducing the consumption of each moves him closer 
to the bliss point. 

Suppose, for example, that the two goods are chocolate cake and ice 
cream. There might well be some optimal amount of chocolate cake and 


Figure 

3.7 
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ice cream that you would want to eat per week. Any less than that amount 
would make you worse off, but any more than that amount would also make 
you worse off. 

If you think about it, most goods are like chocolate cake and ice cream 
in this respect—you can have too much of nearly anything. But people 
would generally not voluntarily choose to have too much of the goods they 
consume. Why would you choose to have more than you want of something? 
Thus the interesting region from the viewpoint of economic choice is where 
you have less than you want of most goods. The choices that people actually 
care about are choices of this sort, and these are the choices with which we 
will be concerned. 


Discrete Goods 

Usually we think of measuring goods in units where fractional amounts 
make sense—you might on average consume 12.43 gallons of milk a month 
even though you buy it a quart at a time. But sometimes we want to 
examine preferences over goods that naturally come in discrete units. 

For example, consider a consumer’s demand for automobiles. We could 
define the demand for automobiles in terms of the time spent using an 
automobile, so that we would have a continuous variable, but for many 
purposes it is the actual number of cars demanded that is of interest. 

There is no difficulty in using preferences to describe choice behavior 
for this kind of discrete good. Suppose that X 2 is money to be spent on 
other goods and X\ is a discrete good that is only available in integer 
amounts. We have illustrated the appearance of indifference “curves” and 
a weakly preferred set for this kind of good in Figure 3.8. In this case the 
bundles indifferent to a given bundle will be a set of discrete points. The 
set of bundles at least as good as a particular bundle will be a set of line 
segments. 

The choice of whether to emphasize the discrete nature of a good or not 
will depend on our application. If the consumer chooses only one or two 
units of the good during the time period of our analysis, recognizing the 
discrete nature of the choice may be important. But if the consumer is 
choosing 30 or 40 units of the good, then it will probably be convenient to 
think of this as a continuous good. 


3.5 Well-Behaved Preferences 

We’ve now seen some examples of indifference curves. As we’ve seen, many 
kinds of preferences, reasonable or unreasonable, can be described by these 
simple diagrams. But if we want to describe preferences in general, it will 
be convenient to focus on a few general shapes of indifference curves. In 
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A discrete good. Here good 1 is only available in integer 
amounts. In panel A the dashed lines connect together the 
bundles that are indifferent, and in panel B the vertical lines 
represent bundles that are at least as good as the indicated 
bundle. 


this section we will describe some more general assumptions that we will 
typically make about preferences and the implications of these assumptions 
for the shapes of the associated indifference curves. These assumptions 
are not the only possible ones; in some situations you might want to use 
different assumptions. But we will take them as the defining features for 
well-behaved indifference curves. 

First we will typically assume that more is better, that is, that we are 
talking about goods , not bads. More precisely, if (x\,X 2 ) is a bundle of 
goods and (z/i, 1 / 2 ) is a bundle of goods with at least as much of both goods 
and more of one, then ( 2 / 1 , 2 / 2 ) >- ( Xi,X 2 )■ This assumption is sometimes 
called mo notonicity of preferences. As we suggested in our discussion of 
satiation, more is better would probably only hold up to a point. Thus 
the assumption of monotonicity is saying only that we are going to ex¬ 
amine situations before that point is reached- before any satiation sets 
in—while more still is better. Economics would not be a very interesting 
subject in a world where everyone was satiated in their consumption of 
every good. 

What does monotonicity imply about the shape of indifference curves? 
It implies that they have a negative slope. Consider Figure 3.9. If we start 
at a bundle (x\,X 2 ) and move anywhere up and to the right, we must be 
moving to a preferred position. If we move down and to the left we must be 
moving to a worse position. So if we are moving to an indifferent position, 
we must be moving either left and up or right and down: the indifference 
curve must have a negative slope. 


Figure 

3.8 
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Figure 

3.9 


Second, we are going to assume that averages are preferred to extremes. 
That is, if we take two bundles of goods (xi,x 2 ) and ( yi,y 2 ) on the same 
indifference curve and take a weighted average of the two bundles such as 

(1 11 1 \ 

^x 1 + - y 1 ,- x 2 + 2 y 2 ), 

then the average bundle will be at least as good as or strictly preferred 
to each of the two extreme bundles. This weighted-average bundle has 
the average amount of good 1 and the average amount of good 2 that is 
present in the two bundles. It therefore lies halfway along the straight line 
connecting the x-bundle and the y- bundle. 



Monotonic preferences. More of both goods is a better 
bundle for this consumer; less of both goods represents a worse 
bundle. 


Actually, we’re going to assume this for any weight t between 0 and 1, 
not just 1/2. Thus we are assuming that if (xi,x 2 ) ~ (yi,Z/ 2 ), then 

(tei + (1 - t)y 1 ,tx 2 + (1 - t)y 2 ) h. (x 1 ,x 2 ) 

for any t such that 0 < t < 1. This weighted average of the two bundles 
gives a weight of t to the x-bundle and a weight of 1 — t to the y-bundle. 
Therefore, the distance from the x-bundle to the average bundle is just 
a fraction t of the distance from the x-bundle to the y-bundle, along the 
straight line connecting the two bundles. 
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What does this assumption about preferences mean geometrically? It 
means that the set of bundles weakly preferred to (xi, X 2 ) is a convex set. 
For suppose that (yi, j/ 2 ) and (xi, X 2 ) are indifferent bundles. Then, if aver¬ 
ages are preferred to extremes, all of the weighted averages of (xi,X 2 ) and 
(j/i, j/ 2 ) are weakly preferred to (xi,X 2 ) and ( 3 / 1 , 2 / 2 )- A convex set has the 
property that if you take any two points in the set and draw the line seg¬ 
ment connecting those two points, that line segment lies entirely in the set. 

Figure 3.10A depicts an example of convex preferences, while Figures 
3.10B and 3.IOC show two examples of nonconvex preferences. Figure 
3.10C presents preferences that are so nonconvex that we might want to 
call them “concave preferences.” 
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A Convex 

B Nonconvex 

C Concave 


preferences 

preferences 

preferences 

Various kinds of preferences. Panel A depicts convex pref¬ 
erences, panel B depicts nonconvex preferences, and panel C 

depicts “concave” preferences. 



Figure 

3.10 


Can you think of preferences that are not convex? One possibility might 
be something like my preferences for ice cream and olives. I like ice cream 
and I like olives ... but I don’t like to have them together! In considering 
my consumption in the next hour, I might be indifferent between consuming 
8 ounces of ice cream and 2 ounces of olives, or 8 ounces of olives and 2 
ounces of ice cream. But either one of these bundles would be better than 
consuming 5 ounces of each! These are the kind of preferences depicted in 
Figure 3.IOC. 

Why do we want to assume that well-behaved preferences are convex? 
Because, for the most part, goods are consumed together. The kinds 
of preferences depicted in Figures 3.10B and 3.IOC imply that the con- 
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sumer would prefer to specialize, at least to some degree, and to consume 
only one of the goods. However, the normal case is where the consumer 
would want to trade some of one good for the other and end up consuming 
some of each, rather than specializing in consuming only one of the two 
goods. 

In fact, if we look at my preferences for monthly consumption of ice 
cream and olives, rather than at my immediate consumption, they would 
tend to look much more like Figure 3.10A than Figure 3.IOC. Each month 
I would prefer having some ice cream and some olives—albeit at different 
times—to specializing in consuming either one for the entire month. 

Finally, one extension of the assumption of convexity is the assumption 
of strict convexity. This means that the weighted average of two in¬ 
different bundles is strictly preferred to the two extreme bundles. Convex 
preferences may have flat spots, while strictly convex preferences must have 
indifferences curves that are “rounded.” The preferences for two goods that 
are perfect substitutes are convex, but not strictly convex. 


3.6 The Marginal Rate of Substitution 

We will often find it useful to refer to the slope of an indifference curve at 
a particular point. This idea is so useful that it even has a name: the slope 
of an indifference curve is known as the marginal rate of substitution 
(MRS). The name comes from the fact that the MRS measures the rate 
at which the consumer is just willing to substitute one good for the other. 

Suppose that we take a little of good 1, A.Ti, away from the consumer. 
Then we give him Ax 2 , an amount that is just sufficient to put him back 
on his indifference curve, so that he is just as well off after this substitution 
of x 2 for x\ as he was before. We think of the ratio Ax^j Ax\ as being the 
rate at which the consumer is willing to substitute good 2 for good 1. 

Now think of Ax\ as being a very small change—a marginal change. 
Then the rate Ax^l Ax\ measures the marginal rate of substitution of good 
2 for good 1. As Ax\ gets smaller, Ax^/Ax\ approaches the slope of the 
indifference curve, as can be seen in Figure 3.11. 

When we write the ratio Aa^/Aa?!, we will always think of both the 
numerator and the denominator as being small numbers—as describing 
marginal changes from the original consumption bundle. Thus the ratio 
defining the MRS will always describe the slope of the indifference curve: 
the rate at which the consumer is just willing to substitute a little more 
consumption of good 2 for a little less consumption of good 1. 

One slightly confusing thing about the MRS is that it is typically a 
negative number. We’ve already seen that monotonic preferences imply 
that indifference curves must have a negative slope. Since the MRS is the 
numerical measure of the slope of an indifference curve, it will naturally be 
a negative number. 
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The marginal rate of substitution (MRS). The marginal 
rate of substitution measures the slope of the indifference curve. 


The marginal rate of substitution measures an interesting aspect of the 
consumer’s behavior. Suppose that the consumer has well-behaved prefer¬ 
ences, that is, preferences that are monotonic and convex, and that he is 
currently consuming some bundle (xi,^)- We now will offer him a trade: 
he can exchange good 1 for 2, or good 2 for 1, in any amount at a “rate of 
exchange” of E. 

That is, if the consumer gives up Ax\ units of good 1, he can get EAx\ 
units of good 2 in exchange. Or, conversely, if he gives up Ax 2 units of good 
2, he can get Ax^/E units of good 1. Geometrically, we are offering the 
consumer an opportunity to move to any point along a line with slope —E 
that passes through (xi,X 2 ), as depicted in Figure 3.12. Moving up and to 
the left from (xi,X 2 ) involves exchanging good 1 for good 2, and moving 
down and to the right involves exchanging good 2 for good 1. In either 
movement, the exchange rate is E. Since exchange always involves giving 
up one good in exchange for another, the exchange rate E corresponds to 
a slope of —E. 

We can now ask what would the rate of exchange have to be in order for 
the consumer to want to stay put at (xi,X 2 )? To answer this question, we 
simply note that any time the exchange line crosses the indifference curve, 
there will be some points on that line that are preferred to (xi,X 2 )—that 
lie above the indifference curve. Thus, if there is to be no movement from 


Figure 

3.11 








50 PREFERENCES (Ch. 3) 


Figure 

3.12 


(xi, X 2 ), the exchange line must be tangent to the indifference curve. That 
is, the slope of the exchange line, —E, must be the slope of the indifference 
curve at (xi,X 2 )- At any other rate of exchange, the exchange line would 
cut the indifference curve and thus allow the consumer to move to a more 
preferred point. 



Trading at an exchange rate. Here we are allowing the con¬ 
sumer to trade the goods at an exchange rate E, which implies 
that the consumer can move along a line with slope —E. 


Thus the slope of the indifference curve, the marginal rate of substitution, 
measures the rate at which the consumer is just on the margin of trading 
or not trading. At any rate of exchange other than the MRS, the consumer 
would want to trade one good for the other. But if the rate of exchange 
equals the MRS, the consumer wants to stay put. 


3.7 Other Interpretations of the MRS 

We have said that the MRS measures the rate at which the consumer is 
just on the margin of being willing to substitute good 1 for good 2. We 
could also say that the consumer is just on the margin of being willing to 
“pay” some of good 1 in order to buy some more of good 2. So sometimes 
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you hear people say that the slope of the indifference curve measures the 

marginal willingness to pay. 

If good 2 represents the consumption of “all other goods,” and it is 
measured in dollars that you can spend on other goods, then the marginal- 
willingness-to-pay interpretation is very natural. The marginal rate of sub¬ 
stitution of good 2 for good 1 is how many dollars you would just be willing 
to give up spending on other goods in order to consume a little bit more 
of good 1. Thus the MRS measures the marginal willingness to give up 
dollars in order to consume a small amount more of good 1. But giving up 
those dollars is just like paying dollars in order to consume a little more of 
good 1. 

If you use the marginal-willingness-to-pay interpretation of the MRS, you 
should be careful to emphasize both the “marginal” and the “willingness” 
aspects. The MRS measures the amount of good 2 that one is willing to 
pay for a marginal amount of extra consumption of good 1. How much 
you actually have to pay for some given amount of extra consumption may 
be different than the amount you are willing to pay. How much you have 
to pay will depend on the price of the good in question. How much you 
are willing to pay doesn’t depend on the price—it is determined by your 
preferences. 

Similarly, how much you may be willing to pay for a large change in 
consumption may be different from how much you are willing to pay for 
a marginal change. How much you actually end up buying of a good will 
depend on your preferences for that good and the prices that you face. How 
much you would be willing to pay for a small amount extra of the good is 
a feature only of your preferences. 


3.8 Behavior of the MRS 

It is sometimes useful to describe the shapes of indifference curves by de¬ 
scribing the behavior of the marginal rate of substitution. For example, 
the “perfect substitutes” indifference curves are characterized by the fact 
that the MRS is constant at —1. The “neutrals” case is characterized by 
the fact that the MRS is everywhere infinite. The preferences for “perfect 
complements” are characterized by the fact that the MRS is either zero or 
infinity, and nothing in between. 

We’ve already pointed out that the assumption of monotonicity implies 
that indifference curves must have a negative slope, so the MRS always 
involves reducing the consumption of one good in order to get more of 
another for monotonic preferences. 

The case of convex indifference curves exhibits yet another kind of be¬ 
havior for the MRS. For strictly convex indifference curves, the MRS—the 
slope of the indifference curve—decreases (in absolute value) as we increase 
x\. Thus the indifference curves exhibit a diminishing marginal rate of 
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substitution. This means that the amount of good 1 that the person is 
willing to give up for an additional amount of good 2 increases the amount 
of good 1 increases. Stated in this way, convexity of indifference curves 
seems very natural: it says that the more you have of one good, the more 
willing you are to give some of it up in exchange for the other good. (But 
remember the ice cream and olives example—for some pairs of goods this 
assumption might not hold!) 


Summary 

1. Economists assume that a consumer can rank various consumption pos¬ 
sibilities. The way in which the consumer ranks the consumption bundles 
describes the consumer’s preferences. 

2. Indifference curves can be used to depict different kinds of preferences. 

3. Well-behaved preferences are monotonic (meaning more is better) and 
convex (meaning averages are preferred to extremes). 

4. The marginal rate of substitution (MRS) measures the slope of the in¬ 
difference curve. This can be interpreted as how much the consumer is 
willing to give up of good 2 to acquire more of good 1. 


REVIEW QUESTIONS 


1. If we observe a consumer choosing ( x±,X 2 ) when (z/i, 7 / 2 ) is available one 
time, are we justified in concluding that (aq,^) >~ (yi, 2 / 2 )? 

2. Consider a group of people A, B, C and the relation “at least as tall as,” 
as in “A is at least as tall as B.” Is this relation transitive? Is it complete? 

3. Take the same group of people and consider the relation “strictly taller 
than.” Is this relation transitive? Is it reflexive? Is it complete? 

4. A college football coach says that given any two linemen A and B, he 
always prefers the one who is bigger and faster. Is this preference relation 
transitive? Is it complete? 

5. Can an indifference curve cross itself? For example, could Figure 3.2 
depict a single indifference curve? 

6. Could Figure 3.2 be a single indifference curve if preferences are mono¬ 
tonic? 
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7. If both pepperoni and anchovies are bads, will the indifference curve 
have a positive or a negative slope? 

8 . Explain why convex preferences means that “averages are preferred to 
extremes.” 

9. What is your marginal rate of substitution of $1 bills for $5 bills? 

10. If good 1 is a “neutral,” what is its marginal rate of substitution for 
good 2? 

11. Think of some other goods for which your preferences might be concave. 


CHAPTER 



UTILITY 


In Victorian days, philosophers and economists talked blithely of “utility” 
as an indicator of a person’s overall well-being. Utility was thought of as 
a numeric measure of a person’s happiness. Given this idea, it was natural 
to think of consumers making choices so as to maximize their utility, that 
is, to make themselves as happy as possible. 

The trouble is that these classical economists never really described how 
we were to measure utility. How are we supposed to quantify the “amount” 
of utility associated with different choices? Is one person’s utility the same 
as another’s? What would it mean to say that an extra candy bar would 
give me twice as much utility as an extra carrot? Does the concept of utility 
have any independent meaning other than its being what people maximize? 

Because of these conceptual problems, economists have abandoned the 
old-fashioned view of utility as being a measure of happiness. Instead, 
the theory of consumer behavior has been reformulated entirely in terms 
of consumer preferences, and utility is seen only as a way to describe 
preferences. 

Economists gradually came to recognize that all that mattered about 
utility as far as choice behavior was concerned was whether one bundle 
had a higher utility than another—how much higher didn’t really matter. 
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Originally, preferences were defined in terms of utility: to say a bundle 
(xi,X 2 ) was preferred to a bundle ( 2/1,2/2 ) meant that the x-bundle had a 
higher utility than the y-bundle. But now we tend to think of things the 
other way around. The preferences of the consumer are the fundamen¬ 
tal description useful for analyzing choice, and utility is simply a way of 
describing preferences. 

A utility function is a way of assigning a number to every possible 
consumption bundle such that more-preferred bundles get assigned larger 
numbers than less-preferred bundles. That is, a bundle (xi, X 2 ) is preferred 
to a bundle (2/1, y 2 ) if and only if the utility of (xi,X2) is larger than the 
utility of (yi, 2/2): in symbols, (xi,X2) >~ (221,2/2) if and only if u(x 1,0:2) > 
u { yi , y 2 )- 

The only property of a utility assignment that is important is how it 
orders the bundles of goods. The magnitude of the utility function is only 
important insofar as it ranks the different consumption bundles; the size of 
the utility difference between any two consumption bundles doesn’t matter. 
Because of this emphasis on ordering bundles of goods, this kind of utility 
is referred to as ordinal utility. 

Consider for example Table 4.1, where we have illustrated several dif¬ 
ferent ways of assigning utilities to three bundles of goods, all of which 
order the bundles in the same way. In this example, the consumer prefers 
A to B and B to C. All of the ways indicated are valid utility functions 
that describe the same preferences because they all have the property that 
A is assigned a higher number than B, which in turn is assigned a higher 
number than C. 


Different ways to assign utilities. Table 

4.1 


Bundle 

Ui 

u 2 

u 3 

A 

3 

17 

-1 

B 

2 

10 

-2 

C 

1 

.002 

—3 


Since only the ranking of the bundles matters, there can be no unique 
way to assign utilities to bundles of goods. If we can find one way to assign 
utility numbers to bundles of goods, we can find an infinite number of 
ways to do it. If u(x\,X 2 ) represents a way to assign utility numbers to 
the bundles (xi,X 2 ), then multiplying u(xi,X 2 ) by 2 (or any other positive 
number) is just as good a way to assign utilities. 

Multiplication by 2 is an example of a monotonic transformation. A 
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Figure 

4.1 


monotonic transformation is a way of transforming one set of numbers into 
another set of numbers in a way that preserves the order of the numbers. 

We typically represent a monotonic transformation by a function /(it) 
that transforms each number u into some other number f(u), in a way 
that preserves the order of the numbers in the sense that u± > U 2 implies 
/(iti) > f(u 2 ). A monotonic transformation and a monotonic function are 
essentially the same thing. 

Examples of monotonic transformations are multiplication by a positive 
number (e.g., f(u) = 3 u), adding any number (e.g., /(it) = it+ 17), raising 
u to an odd power (e.g., f(u) = it 3 ), and so on. 1 

The rate of change of /(it) as it changes can be measured by looking at 
the change in / between two values of it, divided by the change in it: 

A/ = f(u 2 ) - /(m) 

A U U2 — Ui 

For a monotonic transformation, f(u2) — f{u i) always has the same sign as 
U 2 — U\. Thus a monotonic function always has a positive rate of change. 
This means that the graph of a monotonic function will always have a 
positive slope, as depicted in Figure 4.1 A. 


v/ 

V 



v=f(u) 

/v=f(u) 


u 

A 

u 

B 

A positive monotonic transformation. Panel A illustrates 
a monotonic function—one that is always increasing. Panel B 
illustrates a function that is not monotonic, since it sometimes 
increases and sometimes decreases. 


1 What we are calling a “monotonic transformation” is, strictly speaking, called a “posi¬ 
tive monotonic transformation,” in order to distinguish it from a “negative monotonic 
transformation,” which is one that reverses the order of the numbers. Monotonic 
transformations are sometimes called “monotonous transformations,” which seems 
unfair, since they can actually be quite interesting. 
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If f(u) is any monotonic transformation of a utility function that repre¬ 
sents some particular preferences, then f(u(x i, x 2 )) is also a utility function 
that represents those same preferences. 

Why? The argument is given in the following three statements: 

1. To say that u(x 1 , 2 : 2 ) represents some particular preferences means that 
u(x 1 ,x 2 ) > u(y ll y 2 ) if and only if (x lt x 2 ) >- ( 2 / 1 , ^ 2 )- 

2. But if f(u) is a monotonic transformation, then u(x±,x 2 ) > u(yi,y 2 ) if 
and only if f(u(xi,x 2 )) > f(u(y 1; y 2 )). 

3. Therefore, f(u(x u x 2 )) > f(u(y 1 ,y 2 )) if and only if (x!,x 2 ) >- (yi,y 2 ), 
so the function f(u) represents the preferences in the same way as the 
original utility function u(xi,x 2 ). 

We summarize this discussion by stating the following principle: a mono- 
tonic transformation of a utility function is a utility function that represents 
the same preferences as the original utility function. 

Geometrically, a utility function is a way to label indifference curves. 
Since every bundle on an indifference curve must have the same utility, a 
utility function is a way of assigning numbers to the different indifference 
curves in a way that higher indifference curves get assigned larger num¬ 
bers. Seen from this point of view a monotonic transformation is just a 
relabeling of indifference curves. As long as indifference curves containing 
more-preferred bundles get a larger label than indifference curves contain¬ 
ing less-preferred bundles, the labeling will represent the same preferences. 


4.1 Cardinal Utility 

There are some theories of utility that attach a significance to the magni¬ 
tude of utility. These are known as cardinal utility theories. In a theory 
of cardinal utility, the size of the utility difference between two bundles of 
goods is supposed to have some sort of significance. 

We know how to tell whether a given person prefers one bundle of goods 
to another: we simply offer him or her a choice between the two bundles 
and see which one is chosen. Thus we know how to assign an ordinal utility 
to the two bundles of goods: we just assign a higher utility to the chosen 
bundle than to the rejected bundle. Any assignment that does this will be 
a utility function. Thus we have an operational criterion for determining 
whether one bundle has a higher utility than another bundle for some 
individual. 

But how do we tell if a person likes one bundle twice as much as another? 
How could you even tell if you like one bundle twice as much as another? 

One could propose various definitions for this kind of assignment: I like 
one bundle twice as much as another if I am willing to pay twice as much 
for it. Or, I like one bundle twice as much as another if I am willing to run 
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twice as far to get it, or to wait twice as long, or to gamble for it at twice 
the odds. 

There is nothing wrong with any of these definitions; each one would 
give rise to a way of assigning utility levels in which the magnitude of the 
numbers assigned had some operational significance. But there isn’t much 
right about them either. Although each of them is a possible interpretation 
of what it means to want one thing twice as much as another, none of them 
appears to be an especially compelling interpretation of that statement. 

Even if we did find a way of assigning utility magnitudes that seemed 
to be especially compelling, what good would it do us in describing choice 
behavior? To tell whether one bundle or another will be chosen, we only 
have to know which is preferred—which has the larger utility. Knowing 
how much larger doesn’t add anything to our description of choice. Since 
cardinal utility isn’t needed to describe choice behavior and there is no 
compelling way to assign cardinal utilities anyway, we will stick with a 
purely ordinal utility framework. 


4.2 Constructing a Utility Function 

But are we assured that there is any way to assign ordinal utilities? Given 
a preference ordering can we always find a utility function that will order 
bundles of goods in the same way as those preferences? Is there a utility 
function that describes any reasonable preference ordering? 

Not all kinds of preferences can be represented by a utility function. 
For example, suppose that someone had intransitive preferences so that 
A >- B >~ C >- A. Then a utility function for these preferences would have 
to consist of numbers u(A), u(B ), and u{C) such that u(A) > u(B ) > 
u(C) > u(A). But this is impossible. 

However, if we rule out perverse cases like intransitive preferences, it 
turns out that we will typically be able to find a utility function to represent 
preferences. We will illustrate one construction here, and another one in 
Chapter 14. 

Suppose that we are given an indifference map as in Figure 4.2. We know 
that a utility function is a way to label the indifference curves such that 
higher indifference curves get larger numbers. How can we do this? 

One easy way is to draw the diagonal line illustrated and label each 
indifference curve with its distance from the origin measured along the 
line. 

How do we know that this is a utility function? It is not hard to see that 
if preferences are monotonic then the line through the origin must intersect 
every indifference curve exactly once. Thus every bundle is getting a label, 
and those bundles on higher indifference curves are getting larger labels— 
and that’s all it takes to be a utility function. 
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Constructing a utility function from indifference curves. 

Draw a diagonal line and label each indifference curve with how 
far it is from the origin measured along the line. 


This gives us one way to find a labeling of indifference curves, at least as 
long as preferences are monotonic. This won’t always be the most natural 
way in any given case, but at least it shows that the idea of an ordinal utility 
function is pretty general: nearly any kind of “reasonable” preferences can 
be represented by a utility function. 


4.3 Some Examples of Utility Functions 

In Chapter 3 we described some examples of preferences and the indiffer¬ 
ence curves that represented them. We can also represent these preferences 
by utility functions. If you are given a utility function, u(x 1 , 2 : 2 ), it is rel¬ 
atively easy to draw the indifference curves: you just plot all the points 
(xi,X 2 ) such that u(xi,X 2 ) equals a constant. In mathematics, the set of 
all (xi,X 2 ) such that u(x 1 , 0 : 2 ) equals a constant is called a level set. For 
each different value of the constant, you get a different indifference curve. 


EXAMPLE: Indifference Curves from Utility 

Suppose that the utility function is given by: u{X\,X 2 ) = X\X 2 - What do 
the indifference curves look like? 


Figure 

4.2 
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Figure 

4.3 


We know that a typical indifference curve is just the set of all X\ and x 2 
such that k = X\X 2 for some constant k. Solving for x 2 as a function of X\, 
we see that a typical indifference curve has the formula: 

k 

x 2 = —• 

Xi 


This curve is depicted in Figure 4.3 for k = 1, 2, 3 • • •. 



Indifference curves. The indifference curves k = X\X 2 for 
different values of k. 


Let’s consider another example. Suppose that we were given a utility 
function v{x\,x 2 ) = x\x 2 . What do its indifference curves look like? By 
the standard rules of algebra we know that: 

v(xi, x 2 ) = x\x 2 = {xix 2 ) 2 = u(x i,x 2 ) 2 . 

Thus the utility function v(xi,x 2 ) is just the square of the utility func¬ 
tion u(xi,x 2 ). Since u(xi,x 2 ) cannot be negative, it follows that v{x\ 1 x 2 ) 
is a monotonic transformation of the previous utility function, u(xi,x 2 ). 
This means that the utility function v(xi,x 2 ) = x\x\ has to have exactly 
the same shaped indifference curves as those depicted in Figure 4.3. The 
labeling of the indifference curves will be different—the labels that were 
1, 2, 3, • • • will now be 1,4,9, • • • but the set of bundles that has v(x\, x 2 ) = 
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9 is exactly the same as the set of bundles that has u(x i,x 2 ) = 3. Thus 
v(xi,X 2 ) describes exactly the same preferences as u(x i,x 2 ) since it orders 
all of the bundles in the same way. 

Going the other direction—finding a utility function that represents some 
indifference curves- is somewhat more difficult. There are two ways to 
proceed. The first way is mathematical. Given the indifference curves, we 
want to find a function that is constant along each indifference curve and 
that assigns higher values to higher indifference curves. 

The second way is a bit more intuitive. Given a description of the pref¬ 
erences, we try to think about what the consumer is trying to maximize— 
what combination of the goods describes the choice behavior of the con¬ 
sumer. This may seem a little vague at the moment, but it will be more 
meaningful after we discuss a few examples. 


Perfect Substitutes 

Remember the red pencil and blue pencil example? All that mattered to 
the consumer was the total number of pencils. Thus it is natural to measure 
utility by the total number of pencils. Therefore we provisionally pick the 
utility function u(x\ , X 2 ) = Xi+a^- Does this work? Just ask two things: is 
this utility function constant along the indifference curves? Does it assign 
a higher label to more-preferred bundles? The answer to both questions is 
yes, so we have a utility function. 

Of course, this isn’t the only utility function that we could use. We could 
also use the square of the number of pencils. Thus the utility function 
v(xi,X 2 ) = ( X\ + X 2) 2 = x\ + 2xix 2 + x| will also represent the perfect- 
substitutes preferences, as would any other monotonic transformation of 
u{x i,x 2 ). 

What if the consumer is willing to substitute good 1 for good 2 at a rate 
that is different from one-to-one? Suppose, for example, that the consumer 
would require two units of good 2 to compensate him for giving up one unit 
of good 1. This means that good 1 is twice as valuable to the consumer as 
good 2. The utility function therefore takes the form u(xi, x 2 ) = 2xi + x 2 . 
Note that this utility yields indifference curves with a slope of —2. 

In general, preferences for perfect substitutes can be represented by a 
utility function of the form 


u(xi, x 2 ) = axi + bx 2 - 

Here a and b are some positive numbers that measure the “value” of goods 
1 and 2 to the consumer. Note that the slope of a typical indifference curve 
is given by —a/b. 
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Perfect Complements 

This is the left shoe-right shoe case. In these preferences the consumer only 
cares about the number of pairs of shoes he has, so it is natural to choose 
the number of pairs of shoes as the utility function. The number of complete 
pairs of shoes that you have is the minimum of the number of right shoes 
you have, X\, and the number of left shoes you have, X2■ Thus the utility 
function for perfect complements takes the form u{x 1,12) = min{xi,X2}. 

To verify that this utility function actually works, pick a bundle of goods 
such as ( 10 , 10 ). If we add one more unit of good 1 we get ( 11 , 10 ), 
which should leave us on the same indifference curve. Does it? Yes, since 
min{10,10} = minjll, 10} = 10. 

So u(x 1,2:2) = min{a;i, 2:2} is a possible utility function to describe per¬ 
fect complements. As usual, any monotonic transformation would be suit¬ 
able as well. 

What about the case where the consumer wants to consume the goods 
in some proportion other than one-to-one? For example, what about the 
consumer who always uses 2 teaspoons of sugar with each cup of tea? If x\ 
is the number of cups of tea available and 12 is the number of teaspoons 
of sugar available, then the number of correctly sweetened cups of tea will 
be min{xi, ^a^}- 

This is a little tricky so we should stop to think about it. If the number 
of cups of tea is greater than half the number of teaspoons of sugar, then 
we know that we won’t be able to put 2 teaspoons of sugar in each cup. 
In this case, we will only end up with ^X2 correctly sweetened cups of tea. 
(Substitute some numbers in for x\ and X2 to convince yourself.) 

Of course, any monotonic transformation of this utility function will 
describe the same preferences. For example, we might want to multiply by 
2 to get rid of the fraction. This gives us the utility function u[x 1,2:2) = 
min{2a;i, 2 : 2 }. 

In general, a utility function that describes perfect-complement prefer¬ 
ences is given by 

u{x 1, X2) = min{a2;i, 62^}, 

where a and b are positive numbers that indicate the proportions in which 
the goods are consumed. 


Quasilinear Preferences 

Here’s a shape of indifference curves that we haven’t seen before. Suppose 
that a consumer has indifference curves that are vertical translates of one 
another, as in Figure 4 . 4 . This means that all of the indifference curves are 
just vertically “shifted” versions of one indifference curve. It follows that 
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the equation for an indifference curve takes the form X 2 = k — v(xi)j where 
k is a different constant for each indifference curve. This equation says that 
the height of each indifference curve is some function of x\, —v{x\) 1 plus a 
constant k. Higher values of k give higher indifference curves. (The minus 
sign is only a convention; we’ll see why it is convenient below.) 



Quasilinear preferences. Each indifference curve is a verti¬ 
cally shifted version of a single indifference curve. 


The natural way to label indifference curves here is with k- roughly 
speaking, the height of the indifference curve along the vertical axis. Solv¬ 
ing for k and setting it equal to utility, we have 

u(xi, X 2 ) = k = v{xi) + X2- 

In this case the utility function is linear in good 2, but (possibly) non¬ 
linear in good 1; hence the name quasilinear utility, meaning “partly 
linear” utility. Specific examples of quasilinear utility would be u{x 1 , X 2 ) = 
sjx\ + X 2 , or u(x 1 ,^ 2 ) = lnaq + X 2 ■ Quasilinear utility functions are not 
particularly realistic, but they are very easy to work with, as we’ll see in 
several examples later on in the book. 

Cobb-Douglas Preferences 

Another commonly used utility function is the Cobb-Douglas utility func¬ 
tion 

u(xi, X 2 ) = x\x 2 , 


Figure 

4.4 
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Figure 

4.5 


where c and d are positive numbers that describe the preferences of the 
consumer. 

The Cobb-Douglas utility function will be useful in several examples. 
The preferences represented by the Cobb-Douglas utility function have the 
general shape depicted in Figure 4.5. In Figure 4.5A, we have illustrated the 
indifference curves for c = 1/2, d = 1/2. In Figure 4.5B, we have illustrated 
the indifference curves for c = 1/5, d — 4/5. Note how different values of 
the parameters c and d lead to different shapes of the indifference curves. 




Cobb-Douglas indifference curves. Panel A shows the case 
where c = 1/2, d = 1/2 and panel B shows the case where 
c = 1/5, d = 4/5. 


Cobb-Douglas indifference curves look just like the nice convex mono¬ 
tonic indifference curves that we referred to as “well-behaved indifference 
curves” in Chapter 3. Cobb-Douglas preferences are the standard exam¬ 
ple of indifference curves that look well-behaved, and in fact the formula 
describing them is about the simplest algebraic expression that generates 
well-behaved preferences. We’ll find Cobb-Douglas preferences quite useful 
to present algebraic examples of the economic ideas we’ll study later. 

Of course a monotonic transformation of the Cobb-Douglas utility func¬ 
tion will represent exactly the same preferences, and it is useful to see a 
couple of examples of these transformations. 

2 Paul Douglas was a twentieth-century economist at the University of Chicago who 
later became a U.S. senator. Charles Cobb was a mathematician at Amherst College. 
The Cobb-Douglas functional form was originally used to study production behavior. 
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First, if we take the natural log of utility, the product of the terms will 
become a sum so that we have 


v(xi,X 2 ) = ln(xiX$) = cln x\ + din X 2 - 

The indifference curves for this utility function will look just like the ones 
for the first Cobb-Douglas function, since the logarithm is a monotonic 
transformation. (For a brief review of natural logarithms, see the Mathe¬ 
matical Appendix at the end of the book.) 

For the second example, suppose that we start with the Cobb-Douglas 
form 

v(xi,x 2 ) = x\x%. 

Then raising utility to the l/(c+ d) power, we have 

c d 

c+d c+d 
^ 1 J '2 

Now define a new number 

c 

a = - 

c+d 

We can now write our utility function as 

v(x 1 ,x 2 ) = x1x\~ a . 

This means that we can always take a monotonic transformation of the 
Cobb-Douglas utility function that make the exponents sum to 1. This 
will turn out to have a useful interpretation later on. 

The Cobb-Douglas utility function can be expressed in a variety of ways; 
you should learn to recognize them, as this family of preferences is very 
useful for examples. 


4.4 Marginal Utility 


Consider a consumer who is consuming some bundle of goods, (xi,^)- 
How does this consumer’s utility change as we give him or her a little more 
of good 1? This rate of change is called the marginal utility with respect 
to good 1. We write it as MU\ and think of it as being a ratio, 


M£/i 


A U 

Aa?i 


u(x i + Axi, x 2 ) — u(x i, x 2 ) 
Axi 


that measures the rate of change in utility (A U) associated with a small 
change in the amount of good 1 (Axi). Note that the amount of good 2 is 
held fixed in this calculation. 3 


3 


See the appendix to this chapter for a calculus treatment of marginal utility. 
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This definition implies that to calculate the change in utility associated 
with a small change in consumption of good 1, we can just multiply the 
change in consumption by the marginal utility of the good: 


AU = MU 1 Ax 1 . 


The marginal utility with respect to good 2 is defined in a similar manner: 


u(xi,x 2 + Ax 2 ) - u(x i, x 2 ) 



Ax 2 


Note that when we compute the marginal utility with respect to good 2 we 
keep the amount of good 1 constant. We can calculate the change in utility 
associated with a change in the consumption of good 2 by the formula 


AU = MU 2 Ax 2 . 


It is important to realize that the magnitude of marginal utility depends 
on the magnitude of utility. Thus it depends on the particular way that we 
choose to measure utility. If we multiplied utility by 2, then marginal utility 
would also be multiplied by 2. We would still have a perfectly valid utility 
function in that it would represent the same preferences, but it would just 
be scaled differently. 

This means that marginal utility itself has no behavioral content. How 
can we calculate marginal utility from a consumer’s choice behavior? We 
can’t. Choice behavior only reveals information about the way a consumer 
ranks different bundles of goods. Marginal utility depends on the partic¬ 
ular utility function that we use to reflect the preference ordering and its 
magnitude has no particular significance. However, it turns out that mar¬ 
ginal utility can be used to calculate something that does have behavioral 
content, as we will see in the next section. 


4.5 Marginal Utility and MRS 


A utility function u(x\,x 2 ) can be used to measure the marginal rate of 
substitution (MRS) defined in Chapter 3. Recall that the MRS measures 
the slope of the indifference curve at a given bundle of goods; it can be 
interpreted as the rate at which a consumer is just willing to substitute a 
small amount of good 2 for good 1. 

This interpretation gives us a simple way to calculate the MRS. Con¬ 
sider a change in the consumption of each good, (Aa;i,Aa; 2 ), that keeps 
utility constant—that is, a change in consumption that moves us along the 
indifference curve. Then we must have 


MHiAxi + MU 2 Ax 2 = AU = 0. 
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Solving for the slope of the indifference curve we have 


MRS = 


Ax 2 

Axi 


MU i 

mu 2 


(4.1) 


(Note that we have 2 over 1 on the left-hand side of the equation and 1 
over 2 on the right-hand side. Don’t get confused!) 

The algebraic sign of the MRS is negative: if you get more of good 1 you 
have to get less of good 2 in order to keep the same level of utility. However, 
it gets very tedious to keep track of that pesky minus sign, so economists 
often refer to the MRS by its absolute value—that is, as a positive number. 
We’ll follow this convention as long as no confusion will result. 

Now here is the interesting thing about the MRS calculation: the MRS 
can be measured by observing a person’s actual behavior—we find that 
rate of exchange where he or she is just willing to stay put, as described in 
Chapter 3. 

The utility function, and therefore the marginal utility function, is not 
uniquely determined. Any monotonic transformation of a utility function 
leaves you with another equally valid utility function. Thus, if we multiply 
utility by 2, for example, the marginal utility is multiplied by 2. Thus the 
magnitude of the marginal utility function depends on the choice of utility 
function, which is arbitrary. It doesn’t depend on behavior alone; instead 
it depends on the utility function that we use to describe behavior. 

But the ratio of marginal utilities gives us an observable magnitude— 
namely the marginal rate of substitution. The ratio of marginal utilities 
is independent of the particular transformation of the utility function you 
choose to use. Look at what happens if you multiply utility by 2. The 
MRS becomes 


MRS = — 


2MUi 
2 MU 2 ' 


The 2s just cancel out, so the MRS remains the same. 

The same sort of thing occurs when we take any monotonic transforma¬ 
tion of a utility function. Taking a monotonic transformation is just rela¬ 
beling the indifference curves, and the calculation for the MRS described 
above is concerned with moving along a given indifference curve. Even 
though the marginal utilities are changed by monotonic transformations, 
the ratio of marginal utilities is independent of the particular way chosen 
to represent the preferences. 


4.6 Utility for Commuting 

Utility functions are basically ways of describing choice behavior: if a bun¬ 
dle of goods X is chosen when a bundle of goods Y is available, then X 
must have a higher utility than Y. By examining choices consumers make 
we can estimate a utility function to describe their behavior. 
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This idea has been widely applied in the field of transportation economics 
to study consumers’ commuting behavior. In most large cities commuters 
have a choice between taking public transit or driving to work. Each of 
these alternatives can be thought of as representing a bundle of different 
characteristics: travel time, waiting time, out-of-pocket costs, comfort, con¬ 
venience, and so on. We could let X\ be the amount of travel time involved 
in each kind of transportation, x 2 the amount of waiting time for each kind, 
and so on. 

If (xi, X2, ■ ■ ■, x n ) represents the values of n different characteristics of 
driving, say, and (t/i, y 2 , • ■ •, y n ) represents the values of taking the bus, we 
can consider a model where the consumer decides to drive or take the bus 
depending on whether he prefers one bundle of characteristics to the other. 

More specifically, let us suppose that the average consumer’s preferences 
for characteristics can be represented by a utility function of the form 

U(xi,X 2 ,...,X n ) = PlXl + P2X2 H- \~ Pn^n, 

where the coefficients Pi,p 2 , and so on are unknown parameters. Any 
monotonic transformation of this utility function would describe the choice 
behavior equally well, of course, but the linear form is especially easy to 
work with from a statistical point of view. 

Suppose now that we observe a number of similar consumers making 
choices between driving and taking the bus based on the particular pattern 
of commute times, costs, and so on that they face. There are statistical 
techniques that can be used to find the values of the coefficients Pi for i = 
1 that best fit the observed pattern of choices by a set of consumers. 

These statistical techniques give a way to estimate the utility function for 
different transportation modes. 

One study reports a utility function that had the form 4 

U(TW, TT, C) = —0.147 TW - 0.0411TT - 2.24C, (4.2) 

where 

TW = total walking time to and from bus or car 
TT = total time of trip in minutes 
C = total cost of trip in dollars 

The estimated utility function in the Domenich-McFadden book correctly 
described the choice between auto and bus transport for 93 percent of the 
households in their sample. 

4 See Thomas Domenich and Daniel McFadden, Urban Travel Demand (North-Holland 
Publishing Company, 1975). The estimation procedure in this book also incorporated 
various demographic characteristics of the households in addition to the purely eco¬ 
nomic variables described here. Daniel McFadden was awarded the Nobel Prize in 
economics in 2000 for his work in developing techniques to estimate models of this 
sort. 
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The coefficients on the variables in Equation (4.2) describe the weight 
that an average household places on the various characteristics of their 
commuting trips; that is, the marginal utility of each characteristic. The 
ratio of one coefficient to another measures the marginal rate of substitu¬ 
tion between one characteristic and another. For example, the ratio of the 
marginal utility of walking time to the marginal utility of total time indi¬ 
cates that walking time is viewed as being roughly 3 times as onerous as 
travel time by the average consumer. In other words, the consumer would 
be willing to substitute 3 minutes of additional travel time to save 1 minute 
of walking time. 

Similarly, the ratio of cost to travel time indicates the average consumer’s 
tradeoff between these two variables. In this study, the average commuter 
valued a minute of commute time at 0.0411/2.24 = 0.0183 dollars per 
minute, which is $1.10 per hour. For comparison, the hourly wage for the 
average commuter in 1967, the year of the study, was about $2.85 an hour. 

Such estimated utility functions can be very valuable for determining 
whether or not it is worthwhile to make some change in the public trans¬ 
portation system. For example, in the above utility function one of the 
significant factors explaining mode choice is the time involved in taking 
the trip. The city transit authority can, at some cost, add more buses to 
reduce this travel time. But will the number of extra riders warrant the 
increased expense? 

Given a utility function and a sample of consumers we can forecast which 
consumers will drive and which consumers will choose to take the bus. This 
will give us some idea as to whether the revenue will be sufficient to cover 
the extra cost. 

Furthermore, we can use the marginal rate of substitution to estimate 
the value that each consumer places on the reduced travel time. We saw 
above that in the Domenich-McFadden study the average commuter in 
1967 valued commute time at a rate of $1.10 per hour. Thus the commuter 
should be willing to pay about $0.37 to cut 20 minutes from his or her 
trip. This number gives us a measure of the dollar benefit of providing 
more timely bus service. This benefit must be compared to the cost of 
providing more timely bus service in order to determine if such provision 
is worthwhile. Having a quantitative measure of benefit will certainly be 
helpful in making a rational decision about transport policy. 


Summary 

1. A utility function is simply a way to represent or summarize a prefer¬ 
ence ordering. The numerical magnitudes of utility levels have no intrinsic 
meaning. 

2. Thus, given any one utility function, any monotonic transformation of 
it will represent the same preferences. 
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3. The marginal rate of substitution, MRS, can be calculated from the 
utility function via the formula MRS = Aa ^/Axi = —MUx/MU^- 

REVIEW QUESTIONS 


1. The text said that raising a number to an odd power was a monotonic 
transformation. What about raising a number to an even power? Is this a 
monotonic transformation? (Hint: consider the case f(u) = u 2 .) 

2. Which of the following are monotonic transformations? (1) u = 2v — 13; 
(2) u = —1/u 2 ; (3) u = 1/u 2 ; (4) u = lnv; (5) u = —e~ v \ (6) u = v 2 \ 
(7) u = v 2 for v > 0; (8) u = v 2 for v < 0. 

3. We claimed in the text that if preferences were monotonic, then a diag¬ 
onal line through the origin would intersect each indifference curve exactly 
once. Can you prove this rigorously? (Hint: what would happen if it 
intersected some indifference curve twice?) 

4. What kind of preferences are represented by a utility function of the 
form u(xi,X 2 ) = \Jx\ + What about the utility function v(xi,X 2 ) = 
13xi + 13a;2? 

5. What kind of preferences are represented by a utility function of the form 
u{x i, X 2 ) = Xi + y/x 2 ? Is the utility function v(xi,X 2 ) = x 2 + 2x\y/x2 + X 2 
a monotonic transformation of u(x 1 , 2 : 2 )? 

6. Consider the utility function u(x 1 , 0 : 2 ) = x/%iX 2 - What kind of pref¬ 
erences does it represent? Is the function v(xi,X 2 ) = x\x 2 a monotonic 
transformation of u(x 1 , 2 : 2 )? Is the function w(x\,X 2 ) = x\x\ a monotonic 
transformation of u(x 1 , 2 : 2 )? 

7. Can you explain why taking a monotonic transformation of a utility 
function doesn’t change the marginal rate of substitution? 


APPENDIX 

First, let us clarify what is meant by “marginal utility.” As elsewhere in eco¬ 
nomics, “marginal” just means a derivative. So the marginal utility of good 1 is 
just 

,, rr u(x 1 +Ax 1 ,X2)-u(x 1 ,x 2 ) du( Xl,X 2 ) 

1VL U 1 = nm --- = ---. 

Axi^O /\Xi OX 1 

Note that we have used the partial derivative here, since the marginal utility 
of good 1 is computed holding good 2 fixed. 
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Now we can rephrase the derivation of the MRS given in the text using calculus. 
We’ll do it two ways: first by using differentials, and second by using implicit 
functions. 

For the first method, we consider making a change (dx 1, dx 2) that keeps utility 
constant. So we want 


du = 


8 u( £l, £2) 


dx 1 + 


du(x 1, *2) 


dx 2 = 0 . 


dxi 8x2 

The first term measures the increase in utility from the small change dx 1, and 
the second term measures the increase in utility from the small change dx 2. We 
want to pick these changes so that the total change in utility, du, is zero. Solving 
for dx2/dxi gives us 


dx 2 du(xi,X2)/dxi 

dx 1 du(xi,X2)/dx2 : 

which is just the calculus analog of equation ( 4 . 1 ) in the text. 

As for the second method, we now think of the indifference curve as being 
described by a function £2(2:1). That is, for each value of xi, the function £2(2:1) 
tells us how much £2 we need to get on that specific indifference curve. Thus the 
function £2 (£i) has to satisfy the identity 


u(x 1, £ 2 (£i)) = k, 


where k is the utility label of the indifference curve in question. 

We can differentiate both sides of this identity with respect to £1 to get 
du(x\ , £2) du(x 1,3:2) dxijxi) _ Q 

dxi dx 2 dxi 

Notice that £1 occurs in two places in this identity, so changing £1 will change 
the function in two ways, and we have to take the derivative at each place that 
£1 appears. 

We then solve this equation for dx 2 (xi) / dx\ to find 
dx 2 (xi) _ du(xi,x 2 )/dxi 
dxi du(xi, X 2 )/dx 2 ’ 

just as we had before. 

The implicit function method is a little more rigorous, but the differential 
method is more direct, as long as you don’t do something silly. 

Suppose that we take a monotonic transformation of a utility function, say, 
v(x\ ,£2) = /(rt(£i,£2))- Let’s calculate the MRS for this utility function. Using 
the chain rule 

MRS = dv / dxi = df/du du/dx! 

dv/dx 2 df/du du/dx 2 
du/dx 1 
du/dx 2 

since the df/du term cancels out from both the numerator and denominator. 
This shows that the MRS is independent of the utility representation. 

This gives a useful way to recognize preferences that are represented by dif¬ 
ferent utility functions: given two utility functions, just compute the marginal 
rates of substitution and see if they are the same. If they are, then the two 
utility functions have the same indifference curves. If the direction of increasing 
preference is the same for each utility function, then the underlying preferences 
must be the same. 
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EXAMPLE: Cobb-Douglas Preferences 

The MRS for Cobb-Douglas preferences is easy to calculate by using the formula 
derived above. 

If we choose the log representation where 

u(x 1, £2) = clna:i + d \nx 2 , 


then we have 


MRS = - 


du(x i,x 2 )/dxi 
du(x l,X2)/dX2 
c/xi 
d/x 2 


C X 2 
d xi' 


Note that the MRS only depends on the ratio of the two parameters and the 
quantity of the two goods in this case. 

What if we choose the exponent representation where 


U(X 1 ,X 2 ) = X1X2 ? 


Then we have 


MRS = - 


du(x i,x 2 )/dx 
du(x i,x 2 )/dx 


dx\x2 1 

cx 2 


1 


2 


dx 1 ’ 


which is the same as we had before. Of course you knew all along that a monotonic 
transformation couldn’t change the marginal rate of substitution! 






CHAPTER <J 

CHOICE 


In this chapter we will put together the budget set and the theory of prefer¬ 
ences in order to examine the optimal choice of consumers. We said earlier 
that the economic model of consumer choice is that people choose the best 
bundle they can afford. We can now rephrase this in terms that sound more 
professional by saying that “consumers choose the most preferred bundle 
from their budget sets.” 


5.1 Optimal Choice 

A typical case is illustrated in Figure 5.1. Here we have drawn the budget 
set and several of the consumer’s indifference curves on the same diagram. 
We want to find the bundle in the budget set that is on the highest indif¬ 
ference curve. Since preferences are well-behaved, so that more is preferred 
to less, we can restrict our attention to bundles of goods that lie on the 
budget line and not worry about those beneath the budget line. 

Now simply start at the right-hand corner of the budget line and move to 
the left. As we move along the budget line we note that we are moving to 
higher and higher indifference curves. We stop when we get to the highest 
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Figure 

5.1 


indifference curve that just touches the budget line. In the diagram, the 
bundle of goods that is associated with the highest indifference curve that 
just touches the budget line is labeled 

The choice is an optimal choice for the consumer. The set 

of bundles that she prefers to (x*,x|)—the set of bundles above her indif¬ 
ference curve—doesn’t intersect the bundles she can afford -the bundles 
beneath her budget line. Thus the bundle (xjyxij) is the best bundle that 
the consumer can afford. 



Optimal choice. The optimal consumption position is where 
the indifference curve is tangent to the budget line. 


Note an important feature of this optimal bundle: at this choice, the 
indifference curve is tangent to the budget line. If you think about it a 
moment you’ll see that this has to be the case: if the indifference curve 
weren’t tangent, it would cross the budget line, and if it crossed the budget 
line, there would be some nearby point on the budget line that lies above 
the indifference curve—which means that we couldn’t have started at an 
optimal bundle. 








OPTIMAL CHOICE 75 


Does this tangency condition really have to hold at an optimal choice? 
Well, it doesn’t hold in all cases, but it does hold for most interesting cases. 
What is always true is that at the optimal point the indifference curve can’t 
cross the budget line. So when does “not crossing” imply tangent? Let’s 
look at the exceptions first. 

First, the indifference curve might not have a tangent line, as in Fig¬ 
ure 5.2. Here the indifference curve has a kink at the optimal choice, and 
a tangent just isn’t defined, since the mathematical definition of a tangent 
requires that there be a unique tangent line at each point. This case doesn’t 
have much economic significance—it is more of a nuisance than anything 
else. 



Kinky tastes. Here is an optimal consumption bundle where 
the indifference curve doesn’t have a tangent. 


The second exception is more interesting. Suppose that the optimal 
point occurs where the consumption of some good is zero as in Figure 5.3. 
Then the slope of the indifference curve and the slope of the budget line 
are different, but the indifference curve still doesn’t cross the budget line. 


Figure 

5.2 
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Figure 

5.3 


We say that Figure 5.3 represents a boundary optimum, while a case 
like Figure 5.1 represents an interior optimum. 

If we are willing to rule out “kinky tastes” we can forget about the 
example given in Figure 5.2. 1 And if we are willing to restrict ourselves only 
to interior optima, we can rule out the other example. If we have an interior 
optimum with smooth indifference curves, the slope of the indifference curve 
and the slope of the budget line must be the same ... because if they were 
different the indifference curve would cross the budget line, and we couldn’t 
be at the optimal point. 



Boundary optimum. The optimal consumption involves con¬ 
suming zero units of good 2. The indifference curve is not tan¬ 
gent to the budget line. 


We’ve found a necessary condition that the optimal choice must satisfy. 
If the optimal choice involves consuming some of both goods—so that it is 
an interior optimum—then necessarily the indifference curve will be tangent 
to the budget line. But is the tangency condition a sufficient condition for 
a bundle to be optimal? If we find a bundle where the indifference curve 
is tangent to the budget line, can we be sure we have an optimal choice? 

Look at Figure 5.4. Here we have three bundles where the tangency 
condition is satisfied, all of them interior, but only two of them are optimal. 

1 Otherwise, this book might get an R rating. 
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So in general, the tangency condition is only a necessary condition for 
optimality, not a sufficient condition. 



More than one tangency. Here there are three tangencies, 
but only two optimal points, so the tangency condition is nec¬ 
essary but not sufficient. 


However, there is one important case where it is sufficient: the case 
of convex preferences. In the case of convex preferences, any point that 
satisfies the tangency condition must be an optimal point. This is clear 
geometrically: since convex indifference curves must curve away from the 
budget line, they can’t bend back to touch it again. 

Figure 5.4 also shows us that in general there may be more than one 
optimal bundle that satisfies the tangency condition. However, again con¬ 
vexity implies a restriction. If the indifference curves are strictly convex— 
they don’t have any flat spots—then there will be only one optimal choice 
on each budget line. Although this can be shown mathematically, it is also 
quite plausible from looking at the figure. 

The condition that the MRS must equal the slope of the budget line at 
an interior optimum is obvious graphically, but what does it mean econom¬ 
ically? Recall that one of our interpretations of the MRS is that it is that 
rate of exchange at which the consumer is just willing to stay put. Well, 
the market is offering a rate of exchange to the consumer of —p\/p 2 —if 


Figure 

5.4 
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you give up one unit of good 1, you can buy pi/p 2 units of good 2. If the 
consumer is at a consumption bundle where he or she is willing to stay put, 
it must be one where the MRS is equal to this rate of exchange: 

MRS = - —. 

Pi 

Another way to think about this is to imagine what would happen if the 
MRS were different from the price ratio. Suppose, for example, that the 
MRS is Aa^/Aaq = —1/2 and the price ratio is 1/1. Then this means the 
consumer is just willing to give up 2 units of good 1 in order to get 1 unit of 
good 2—but the market is willing to exchange them on a one-to-one basis. 
Thus the consumer would certainly be willing to give up some of good 1 in 
order to purchase a little more of good 2. Whenever the MRS is different 
from the price ratio, the consumer cannot be at his or her optimal choice. 


5.2 Consumer Demand 

The optimal choice of goods 1 and 2 at some set of prices and income is 
called the consumer’s demanded bundle. In general when prices and 
income change, the consumer’s optimal choice will change. The demand 
function is the function that relates the optimal choice—the quantities 
demanded—to the different values of prices and incomes. 

We will write the demand functions as depending on both prices and 
income: Xi(pi,p 2 , m) and X 2 {pi,P 2 , tti). For each different set of prices and 
income, there will be a different combination of goods that is the optimal 
choice of the consumer. Different preferences will lead to different demand 
functions; we’ll see some examples shortly. Our major goal in the next 
few chapters is to study the behavior of these demand functions—how the 
optimal choices change as prices and income change. 


5.3 Some Examples 

Let us apply the model of consumer choice we have developed to the exam¬ 
ples of preferences described in Chapter 3. The basic procedure will be the 
same for each example: plot the indifference curves and budget line and 
find the point where the highest indifference curve touches the budget line. 


Perfect Substitutes 

The case of perfect substitutes is illustrated in Figure 5.5. We have three 
possible cases. If P2 > Pi, then the slope of the budget line is flatter than 
the slope of the indifference curves. In this case, the optimal bundle is 
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where the consumer spends all of his or her money on good 1. If p\ > p 2 , 
then the consumer purchases only good 2. Finally, if pi = p 2 , there is a 
whole range of optimal choices—any amount of goods 1 and 2 that satisfies 
the budget constraint is optimal in this case. Thus the demand function 
for good 1 will be 

{ m/pi when pi < p 2 ; 

any number between 0 and m/p\ when p\ = p 2 ; 

0 when pi > p 2 . 

Are these results consistent with common sense? All they say is that 
if two goods are perfect substitutes, then a consumer will purchase the 
cheaper one. If both goods have the same price, then the consumer doesn’t 
care which one he or she purchases. 



Optimal choice with perfect substitutes. If the goods are 
perfect substitutes, the optimal choice will usually be on the 
boundary. 


Perfect Complements 

The case of perfect complements is illustrated in Figure 5.6. Note that 
the optimal choice must always lie on the diagonal, where the consumer is 
purchasing equal amounts of both goods, no matter what the prices are. 


Figure 

5.5 
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In terms of our example, this says that people with two feet buy shoes in 
pairs. 2 

Let us solve for the optimal choice algebraically. We know that this 
consumer is purchasing the same amount of good 1 and good 2, no matter 
what the prices. Let this amount be denoted by x. Then we have to satisfy 
the budget constraint 

Pix + P2X = to . 


Solving for x gives us the optimal choices of goods 1 and 2: 

TO 

X\ = X 2 = x = -. 

Pi +P2 

The demand function for the optimal choice here is quite intuitive. Since 
the two goods are always consumed together, it is just as if the consumer 
were spending all of her money on a single good that had a price of Pi+P 2 - 


Figure 

5.6 
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*1 

Optimal choice with perfect complements. If the goods 
are perfect complements, the quantities demanded will always 
lie on the diagonal since the optimal choice occurs where x\ 
equals X 2 ■ 


2 


Don’t worry, we’ll get some more exciting results later on. 
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Figure 

5.7 


Neutrals and Bads 

In the case of a neutral good the consumer spends all of her money on the 
good she likes and doesn’t purchase any of the neutral good. The same 
thing happens if one commodity is a bad. Thus, if commodity 1 is a good 
and commodity 2 is a bad, then the demand functions will be 

TO 

Xi = — 

Pi 

x 2 = 0. 




Discrete goods. In panel A the demand for good 1 is zero, 
while in panel B one unit will be demanded. 


Discrete Goods 

Suppose that good 1 is a discrete good that is available only in integer 
units, while good 2 is money to be spent on everything else. If the con¬ 
sumer chooses 1,2, 3, ••• units of good 1, she will implicitly choose the 
consumption bundles (1 ,to — p\), (2,to — 2pi), (3, to — 3pi), and so on. We 
can simply compare the utility of each of these bundles to see which has 
the highest utility. 

Alternatively, we can use the indifference-curve analysis in Figure 5.7. As 
usual, the optimal bundle is the one on the highest indifference “curve.” If 
the price of good 1 is very high, then the consumer will choose zero units 
of consumption; as the price decreases the consumer will find it optimal to 
consume 1 unit of the good. Typically, as the price decreases further the 
consumer will choose to consume more units of good 1. 
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Concave Preferences 


Consider the situation illustrated in Figure 5.8. Is X the optimal choice? 
No! The optimal choice for these preferences is always going to be a bound¬ 
ary choice, like bundle Z. Think of what nonconvex preferences mean. If 
you have money to purchase ice cream and olives, and you don’t like to 
consume them together, you’ll spend all of your money on one or the other. 


Figure 

5.8 



Optimal choice with concave preferences. The optimal 
choice is the boundary point, Z, not the interior tangency point, 
X , because Z lies on a higher indifference curve. 


Cobb-Douglas Preferences 


Suppose that the utility function is of the Cobb-Douglas form, u{x i, £ 2 ) = 
x\x 2 - In the Appendix to this chapter we use calculus to derive the optimal 
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choices for this utility function. They turn out to be 

c TO 
11 = — 

c + dp i 

d to 

X2 = ““ — • 
c + dp 2 

These demand functions are often useful in algebraic examples, so you 
should probably memorize them. 

The Cobb-Douglas preferences have a convenient property. Consider the 
fraction of his income that a Cobb-Douglas consumer spends on good 1. If 
he consumes x\ units of good 1, this costs him piXi, so this represents a 
fraction piXi/m of total income. Substituting the demand function for x\ 
we have 

PiXi Pi c to c 

to to c + dpi c + d 

Similarly the fraction of his income that the consumer spends on good 2 is 

d/(c + d). 

Thus the Cobb-Douglas consumer always spends a fixed fraction of his 
income on each good. The size of the fraction is determined by the exponent 
in the Cobb-Douglas function. 

This is why it is often convenient to choose a representation of the Cobb- 
Douglas utility function in which the exponents sum to 1. If u(x 1 , 2 : 2 ) = 
x i x 2 ~ a > then we can immediately interpret a as the fraction of income spent 
on good 1. For this reason we will usually write Cobb-Douglas preferences 
in this form. 


5.4 Estimating Utility Functions 

We’ve now seen several different forms for preferences and utility functions 
and have examined the kinds of demand behavior generated by these pref¬ 
erences. But in real life we usually have to work the other way around: we 
observe demand behavior, but our problem is to determine what kind of 
preferences generated the observed behavior. 

For example, suppose that we observe a consumer’s choices at several 
different prices and income levels. An example is depicted in Table 5.1. 
This is a table of the demand for two goods at the different levels of prices 
and incomes that prevailed in different years. We have also computed 
the share of income spent on each good in each year using the formulas 
Si = piXi/m and S 2 = P 2 x 2 /m. 

For these data, the expenditure shares are relatively constant. There are 
small variations from observation to observation, but they probably aren’t 
large enough to worry about. The average expenditure share for good 1 is 
about 1/4, and the average income share for good 2 is about 3/4. It appears 
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Table 

5.1 


Some data describing consumption behavior. 


Year 

Pi 

P2 

m 

Xl 

X2 

Sl 

S2 

Utility 

1 

1 

1 

100 

25 

75 

.25 

.75 

57.0 

2 

1 

2 

100 

24 

38 

.24 

.76 

33.9 

3 

2 

1 

100 

13 

74 

.26 

.74 

47.9 

4 

1 

2 

200 

48 

76 

.24 

.76 

67.8 

5 

2 

1 

200 

25 

150 

.25 

.75 

95.8 

6 

1 

4 

400 

100 

75 

.25 

.75 

80.6 

7 

4 

1 

400 

24 

304 

.24 

.76 

161.1 


1 3 

that a utility function of the form u(x\,X 2 ) = x*x^ seems to fit these 
data pretty well. That is, a utility function of this form would generate 
choice behavior that is pretty close to the observed choice behavior. For 
convenience we have calculated the utility associated with each observation 
using this estimated Cobb-Douglas utility function. 

As far as we can tell from the observed behavior it appears as though the 

I 2 

consumer is maximizing the function u(x i, £ 2 ) = xfx^- It may well be that 
further observations on the consumer’s behavior would lead us to reject this 
hypothesis. But based on the data we have, the fit to the optimizing model 
is pretty good. 

This has very important implications, since we can now use this “fitted” 
utility function to evaluate the impact of proposed policy changes. Suppose, 
for example, that the government was contemplating imposing a system of 
taxes that would result in this consumer facing prices (2,3) and having an 
income of 200. According to our estimates, the demanded bundle at these 
prices would be 


1 200 

3200 50 

* 2 = 4 ^ = 5 °- 


The estimated utility of this bundle is 


u(xi,x 2 ) = 25^50^ ~ 42. 

This means that the new tax policy would make the consumer better off 
than he was in year 2, but worse off than he was in year 3. Thus we can use 
the observed choice behavior to value the implications of proposed policy 
changes on this consumer. 

Since this is such an important idea in economics, let us review the 
logic one more time. Given some observations on choice behavior, we try 
to determine what, if anything, is being maximized. Once we have an 
estimate of what it is that is being maximized, we can use this both to 
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predict choice behavior in new situations and to evaluate proposed changes 
in the economic environment. 

Of course we have described a very simple situation. In reality, we nor¬ 
mally don’t have detailed data on individual consumption choices. But we 
often have data on groups of individuals—teenagers, middle-class house¬ 
holds, elderly people, and so on. These groups may have different prefer¬ 
ences for different goods that are reflected in their patterns of consumption 
expenditure. We can estimate a utility function that describes their con¬ 
sumption patterns and then use this estimated utility function to forecast 
demand and evaluate policy proposals. 

In the simple example described above, it was apparent that income 
shares were relatively constant so that the Cobb-Douglas utility function 
would give us a pretty good fit. In other cases, a more complicated form 
for the utility function would be appropriate. The calculations may then 
become messier, and we may need to use a computer for the estimation, 
but the essential idea of the procedure is the same. 


5.5 Implications of the MRS Condition 

In the last section we examined the important idea that observation of de¬ 
mand behavior tells us important things about the underlying preferences 
of the consumers that generated that behavior. Given sufficient observa¬ 
tions on consumer choices it will often be possible to estimate the utility 
function that generated those choices. 

But even observing one consumer choice at one set of prices will allow 
us to make some kinds of useful inferences about how consumer utility will 
change when consumption changes. Let us see how this works. 

In well-organized markets, it is typical that everyone faces roughly the 
same prices for goods. Take, for example, two goods like butter and milk. 
If everyone faces the same prices for butter and milk, and everyone is 
optimizing, and everyone is at an interior solution ... then everyone must 
have the same marginal rate of substitution for butter and milk. 

This follows directly from the analysis given above. The market is offer¬ 
ing everyone the same rate of exchange for butter and milk, and everyone 
is adjusting their consumption of the goods until their own “internal” mar¬ 
ginal valuation of the two goods equals the market’s “external” valuation 
of the two goods. 

Now the interesting thing about this statement is that it is independent 
of income and tastes. People may value their total consumption of the two 
goods very differently. Some people may be consuming a lot of butter and 
a little milk, and some may be doing the reverse. Some wealthy people 
may be consuming a lot of milk and a lot of butter while other people may 
be consuming just a little of each good. But everyone who is consuming 
the two goods must have the same marginal rate of substitution. Everyone 
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who is consuming the goods must agree on how much one is worth in terms 
of the other: how much of one they would be willing to sacrifice to get some 
more of the other. 

The fact that price ratios measure marginal rates of substitution is very 
important, for it means that we have a way to value possible changes in 
consumption bundles. Suppose, for example, that the price of milk is $1 
a quart and the price of butter is $2 a pound. Then the marginal rate of 
substitution for all people who consume milk and butter must be 2: they 
have to have 2 quarts of milk to compensate them for giving up 1 pound 
of butter. Or conversely, they have to have 1 pound of butter to make 
it worth their while to give up 2 quarts of milk. Hence everyone who is 
consuming both goods will value a marginal change in consumption in the 
same way. 

Now suppose that an inventor discovers a new way of turning milk into 
butter: for every 3 quarts of milk poured into this machine, you get out 
1 pound of butter, and no other useful byproducts. Question: is there 
a market for this device? Answer: the venture capitalists won’t beat a 
path to his door, that’s for sure. For everyone is already operating at a 
point where they are just willing to trade 2 quarts of milk for 1 pound 
of butter; why would they be willing to substitute 3 quarts of milk for 1 
pound of butter? The answer is they wouldn’t; this invention isn’t worth 
anything. 

But what would happen if he got it to run in reverse so he could dump 
in a pound of butter get out 3 quarts of milk? Is there a market for this 
device? Answer: yes! The market prices of milk and butter tell us that 
people are just barely willing to trade one pound of butter for 2 quarts of 
milk. So getting 3 quarts of milk for a pound of butter is a better deal than 
is currently being offered in the marketplace. Sign me up for a thousand 
shares! (And several pounds of butter.) 

The market prices show that the first machine is unprofitable: it produces 
$2 of butter by using $3 of milk. The fact that it is unprofitable is just 
another way of saying that people value the inputs more than the outputs. 
The second machine produces $3 worth of milk by using only $2 worth of 
butter. This machine is profitable because people value the outputs more 
than the inputs. 

The point is that, since prices measure the rate at which people are just 
willing to substitute one good for another, they can be used to value policy 
proposals that involve making changes in consumption. The fact that prices 
are not arbitrary numbers but reflect how people value things on the margin 
is one of the most fundamental and important ideas in economics. 

If we observe one choice at one set of prices we get the MRS at one 
consumption point. If the prices change and we observe another choice we 
get another MRS. As we observe more and more choices we learn more 
and more about the shape of the underlying preferences that may have 
generated the observed choice behavior. 
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5.6 Choosing Taxes 

Even the small bit of consumer theory we have discussed so far can be used 
to derive interesting and important conclusions. Here is a nice example 
describing a choice between two types of taxes. We saw that a quantity 
tax is a tax on the amount consumed of a good, like a gasoline tax of 
15 cents per gallon. An income tax is just a tax on income. If the 
government wants to raise a certain amount of revenue, is it better to raise 
it via a quantity tax or an income tax? Let’s apply what we’ve learned to 
answer this question. 

First we analyze the imposition of a quantity tax. Suppose that the 
original budget constraint is 


PiXi +P 2 X 2 = m. 

What is the budget constraint if we tax the consumption of good 1 at a 
rate of tl The answer is simple. From the viewpoint of the consumer it is 
just as if the price of good 1 has increased by an amount t. Thus the new 
budget constraint is 

(pi + t)x 1 + P 2 X 2 = to. (5.1) 

Therefore a quantity tax on a good increases the price perceived by 
the consumer. Figure 5.9 gives an example of how that price change might 
affect demand. At this stage, we don’t know for certain whether this tax will 
increase or decrease the consumption of good 1, although the presumption 
is that it will decrease it. Whichever is the case, we do know that the 
optimal choice, (x*,^), must satisfy the budget constraint 

(pi + t)x\ + p 2 x 2 = to. (5.2) 

The revenue raised by this tax is R* = tx\. 

Let’s now consider an income tax that raises the same amount of revenue. 
The form of this budget constraint would be 


P1X1 + P2X2 = to — R* 


or, substituting for R*, 


P 1 X 1 + P 2 X 2 = m — tx\. 


Where does this budget line go in Figure 5.9? 

It is easy to see that it has the same slope as the original budget line, 
~Pi/P 2 , but the problem is to determine its location. As it turns out, the 
budget line with the income tax must pass through the point (xjyxj). The 
way to check this is to plug (xjyx^) into the income-tax budget constraint 
and see if it is satisfied. 


88 CHOICE (Ch. 5) 


Figure 

5.9 



with quantity tax 
slope = -(p + t)/p 

Income tax versus a quantity tax. Here we consider a quan¬ 
tity tax that raises revenue R* and an income tax that raises 
the same revenue. The consumer will be better off under the 
income tax, since he can choose a point on a higher indifference 
curve. 


Is it true that 


Pi a;* + P 2 X 2 — m — tx\l 


Yes it is, since this is just a rearrangement of equation (5.2), which we 
know to be true. 

This establishes that (xi,x 2 ) lies on the income tax budget line: it is an 
affordable choice for the consumer. But is it an optimal choice? It is easy 
to see that the answer is no. At (x\,x 2 ) the MRS is —{pi + t)/p 2 ■ But the 
income tax allows us to trade at a rate of exchange of —p±/p 2 - Thus the 
budget line cuts the indifference curve at (xl,x 2 ), which implies that there 
will be some point on the budget line that will be preferred to (x*,x 2 ). 

Therefore the income tax is definitely superior to the quantity tax in 
the sense that you can raise the same amount of revenue from a consumer 
and still leave him or her better off under the income tax than under the 
quantity tax. 

This is a nice result, and worth remembering, but it is also worthwhile 
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understanding its limitations. First, it only applies to one consumer. The 
argument shows that for any given consumer there is an income tax that 
will raise as much money from that consumer as a quantity tax and leave 
him or her better off. But the amount of that income tax will typically differ 
from person to person. So a uniform income tax for all consumers is not 
necessarily better than a uniform quantity tax for all consumers. (Think 
about a case where some consumer doesn’t consume any of good 1—this 
person would certainly prefer the quantity tax to a uniform income tax.) 

Second, we have assumed that when we impose the tax on income the 
consumer’s income doesn’t change. We have assumed that the income tax 
is basically a lump sum tax—one that just changes the amount of money 
a consumer has to spend but doesn’t affect any choices he has to make. 
This is an unlikely assumption. If income is earned by the consumer, we 
might expect that taxing it will discourage earning income, so that after-tax 
income might fall by even more than the amount taken by the tax. 

Third, we have totally left out the supply response to the tax. We’ve 
shown how demand responds to the tax change, but supply will respond 
too, and a complete analysis would take those changes into account as well. 


Summary 

1. The optimal choice of the consumer is that bundle in the consumer’s 
budget set that lies on the highest indifference curve. 

2. Typically the optimal bundle will be characterized by the condition that 
the slope of the indifference curve (the MRS) will equal the slope of the 
budget line. 

3. If we observe several consumption choices it may be possible to estimate 
a utility function that would generate that sort of choice behavior. Such a 
utility function can be used to predict future choices and to estimate the 
utility to consumers of new economic policies. 

4. If everyone faces the same prices for the two goods, then everyone will 
have the same marginal rate of substitution, and will thus be willing to 
trade off the two goods in the same way. 


REVIEW QUESTIONS 


1. If two goods are perfect substitutes, what is the demand function for 
good 2? 
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2. Suppose that indifference curves are described by straight lines with a 
slope of —b. Given arbitrary prices and money income pi, p 2 > and m, what 
will the consumer’s optimal choices look like? 

3. Suppose that a consumer always consumes 2 spoons of sugar with each 
cup of coffee. If the price of sugar is p\ per spoonful and the price of coffee 
is P 2 per cup and the consumer has to dollars to spend on coffee and sugar, 
how much will he or she want to purchase? 

4. Suppose that you have highly nonconvex preferences for ice cream and 
olives, like those given in the text, and that you face prices pi, p 2 and have 
to dollars to spend. List the choices for the optimal consumption bundles. 

5. If a consumer has a utility function u(xi,a; 2 ) = x\x\, what fraction of 
her income will she spend on good 2? 

6. For what kind of preferences will the consumer be just as well-off facing 
a quantity tax as an income tax? 


APPENDIX 

It is very useful to be able to solve the preference-maximization problem and get 
algebraic examples of actual demand functions. We did this in the body of the 
text for easy cases like perfect substitutes and perfect complements, and in this 
Appendix we’ll see how to do it in more general cases. 

First, we will generally want to represent the consumer’s preferences by a utility 
function, u(x 1 , 12 ). We’ve seen in Chapter 4 that this is not a very restrictive 
assumption; most well-behaved preferences can be described by a utility function. 

The first thing to observe is that we already know how to solve the optimal- 
choice problem. We just have to put together the facts that we learned in the 
last three chapters. We know from this chapter that an optimal choice (£ 1 , 2 : 2 ) 
must satisfy the condition 


MRS(aJi,x 2 ) = — —, (5.3) 

P 2 

and we saw in the Appendix to Chapter 4 that the MRS can be expressed as the 
negative of the ratio of derivatives of the utility function. Making this substitution 
and cancelling the minus signs, we have 

du(xi,x 2 )/dxi _ pi 
du(xi,X2)/dX2 P2 

From Chapter 2 we know that the optimal choice must also satisfy the budget 
constraint 

piXi+p2X2 = m. (5.5) 

This gives us two equations—the MRS condition and the budget constraint— 
and two unknowns, xi and X 2 - All we have to do is to solve these two equations 
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to find the optimal choices of x\ and X2 as a function of the prices and income. 
There are a number of ways to solve two equations in two unknowns. One way 
that always works, although it might not always be the simplest, is to solve the 
budget constraint for one of the choices, and then substitute that into the MRS 
condition. 

Rewriting the budget constraint, we have 


m »i 

X2= - Xl 

p 2 P 2 


(5.6) 


and substituting this into equation ( 5 . 4 ) we get 


du{xi,m/p2 - (pi/p2)xi)/dx 1 _ pi 
du(xi,m/p2 - (pi/p2)xi)/dx 2 P2' 

This rather formidable looking expression has only one unknown variable, *1, 
and it can typically be solved for *1 in terms of (pi,p2,m). Then the budget 
constraint yields the solution for X2 as a function of prices and income. 

We can also derive the solution to the utility maximization problem in a more 
systematic way, using calculus conditions for maximization. To do this, we first 
pose the utility maximization problem as a constrained maximization problem: 


max u(x 1, £2) 

xi ,X2 


such that pi*i + P2X2 = m. 

This problem asks that we choose values of Xi and X2 that do two things: 
first, they have to satisfy the constraint, and second, they give a larger value for 
u(*i,*2) than any other values of xi and X2 that satisfy the constraint. 

There are two useful ways to solve this kind of problem. The first way is simply 
to solve the constraint for one of the variables in terms of the other and then 
substitute it into the objective function. 

For example, for any given value of xi, the amount of X2 that we need to 
satisfy the budget constraint is given by the linear function 

* 2 ( 2 : 1 ) = — - —xi. (5.7) 

P 2 P 2 

Now substitute *2(2:1) for X2 in the utility function to get the unconstrained 
maximization problem 


max«(ii,m/j)2 - (pi/p 2 )xi). 

xi 


This is an unconstrained maximization problem in 2:1 alone, since we have used 
the function *2(2:1) to ensure that the value of *2 will always satisfy the budget 
constraint, whatever the value of *1 is. 

We can solve this kind of problem just by differentiating with respect to *1 
and setting the result equal to zero in the usual way. This procedure will give us 
a first-order condition of the form 


< 9 m(*i,* 2 (*i)) 


+ 


du{x 1,*2(*i)) dx2 
< 9*2 dx 1 


= 0 . 


9*i 


(5.8) 
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Here the first term is the direct effect of how increasing xi increases utility. The 
second term consists of two parts: the rate of increase of utility as X2 increases, 
du/dx2, times dx2/dx\, the rate of increase of X2 as xi increases in order to 
continue to satisfy the budget equation. We can differentiate ( 5 . 7 ) to calculate 
this latter derivative 

dx2 _ pi 
dx 1 P2 

Substituting this into ( 5 . 8 ) gives us 

du(x*, xl)/dxi _ pi 
du{x1, xl)/dx2 P 2 

which just says that the marginal rate of substitution between xi and X2 must 
equal the price ratio at the optimal choice (xi,x 2 ). This is exactly the condition 
we derived above: the slope of the indifference curve must equal the slope of the 
budget line. Of course the optimal choice must also satisfy the budget constraint 
pi a:* +P2a;2 = to, which again gives us two equations in two unknowns. 

The second way that these problems can be solved is through the use of La¬ 
grange multipliers. This method starts by defining an auxiliary function known 
as the Lagrangiarv. 

L = u(xi,X2) — \{pixi +P2X2 — m). 


The new variable A is called a Lagrange multiplier since it is multiplied by the 
constraint . 3 Then Lagrange’s theorem says that an optimal choice (*j, *2) must 
satisfy the three first-order conditions 


dL 

dxi 

8 L 

8x2 


/ * * \ 
OU[X 1 , X2) 

dxi 

/ * * \ 
OU\X 1 , X2) 

dx 2 


Api = 0 


Ap2 = 0 


dL 

dX 


= p ix\ + P2X2 — m = 0. 


There are several interesting things about these three equations. First, note 
that they are simply the derivatives of the Lagrangian with respect to xi, X2, 
and A, each set equal to zero. The last derivative, with respect to A, is just the 
budget constraint. Second, we now have three equations for the three unknowns, 
xi, X2, and A. We have a hope of solving for xi and X2 in terms of pi, P2, and 

TO. 

Lagrange’s theorem is proved in any advanced calculus book. It is used quite 
extensively in advanced economics courses, but for our purposes we only need to 
know the statement of the theorem and how to use it. 

In our particular case, it is worthwhile noting that if we divide the first condi¬ 
tion by the second one, we get 


du(xl,x 2 )/dxi _ pi 

du(x\,X2)/dx2 P2’ 


which simply says the MRS must equal the price ratio, just as before. The budget 
constraint gives us the other equation, so we are back to two equations in two 
unknowns. 


3 The Greek letter A is pronounced “lamb-da.' 
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EXAMPLE: Cobb-Douglas Demand Functions 

In Chapter 4 we introduced the Cobb-Douglas utility function 

u(x 1,12) = xlx%. 

Since utility functions are only defined up to a monotonic transformation, it is 
convenient to take logs of this expression and work with 

In u(xi, X2) = clnxi + dlnx2- 

Let’s find the demand functions for x\ and X2 for the Cobb-Douglas utility 
function. The problem we want to solve is 

max c In xi + d In X2 
X 1 > x 2 


such that pixi + P2X2 = m. 

There are at least three ways to solve this problem. One way is just to write 
down the MRS condition and the budget constraint. Using the expression for the 
MRS derived in Chapter 4 , we have 


cx 2 _ p 1 
dx 1 P2 
P1X1 +P2X2 = m. 


These are two equations in two unknowns that can be solved for the optimal 
choice of Xi and X2■ One way to solve them is to substitute the second into the 
first to get 

c(m/p 2 ~ X1P1/P2) _ Pi 
dx 1 P2 


Cross multiplying gives 


c(m — xipi) = dpixi. 


Rearranging this equation gives 


cm = (c + d)pixi 


o-I — - —. 

c + dpi 

This is the demand function for xi. To find the demand function for X2, substitute 
into the budget constraint to get 


m pi c m 

P2 P2 c + dpi 

d m 


c + dp2 
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The second way is to substitute the budget constraint into the maximization 
problem at the beginning. If we do this, our problem becomes 

maxclnii + dln(m/p2 — *iPi/P2)- 


The first-order condition for this problem is 


— - di¬ 


pt 


^= 0 . 


Xi m —pixi P2 
A little algebra—which you should do!—gives us the solution 

c m 

Xl 


c + dp i 

Substitute this back into the budget constraint X2 = rnlp2 — X1P1/P2 to get 

d m 


X 2 


c + dp 2 ' 


These are the demand functions for the two goods, which, happily, are the same 
as those derived earlier by the other method. 

Now for Lagrange’s method. Set up the Lagrangian 

L = clnxi + d\nx2 — X(pixi + P2X2 — m) 

and differentiate to get the three first-order conditions 

dLc 

=- Xpi = 0 

OX 1 Xl 

dL d , n 

a =- Xp2=0 

OX 2 X 2 

dL 

—- = pixi + P2X2 - mm 0 . 
oX 

Now the trick is to solve them! The best way to proceed is to first solve for A and 
then for xi and X2- So we rearrange and cross multiply the first two equations 
to get 

c = XpiXi 
d = Xp2X2- 

These equations are just asking to be added together: 


which gives us 


c + d= X(pixi + P2X2) = Am, 
A =c±A. 


Substitute this back into the first two equations and solve for x\ and X2 to get 

xi = 


X 2 = 


c m 
c + dpi 
d m 
c + dp 2 ’ 


just as before. 








CHAPTER 



DEMAND 


In the last chapter we presented the basic model of consumer choice: how 
maximizing utility subject to a budget constraint yields optimal choices. 
We saw that the optimal choices of the consumer depend on the consumer’s 
income and the prices of the goods, and we worked a few examples to see 
what the optimal choices are for some simple kinds of preferences. 

The consumer’s demand functions give the optimal amounts of each 
of the goods as a function of the prices and income faced by the consumer. 
We write the demand functions as 

x\ = xi(pi,p 2 ,m) 

x 2 = x 2 {pi 1 P2,m). 


The left-hand side of each equation stands for the quantity demanded. The 
right-hand side of each equation is the function that relates the prices and 
income to that quantity. 

In this chapter we will examine how the demand for a good changes as 
prices and income change. Studying how a choice responds to changes in the 
economic environment is known as comparative statics, which we first 
described in Chapter 1. “Comparative” means that we want to compare 
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two situations: before and after the change in the economic environment. 
“Statics” means that we are not concerned with any adjustment process 
that may be involved in moving from one choice to another; rather we will 
only examine the equilibrium choice. 

In the case of the consumer, there are only two things in our model 
that affect the optimal choice: prices and income. The comparative statics 
questions in consumer theory therefore involve investigating how demand 
changes when prices and income change. 


6.1 Normal and Inferior Goods 


We start by considering how a consumer’s demand for a good changes 
as his income changes. We want to know how the optimal choice at one 
income compares to the optimal choice at another level of income. During 
this exercise, we will hold the prices fixed and examine only the change in 
demand due to the income change. 

We know how an increase in money income affects the budget line when 
prices are fixed—it shifts it outward in a parallel fashion. So how does this 
affect demand? 

We would normally think that the demand for each good would increase 
when income increases, as shown in Figure 6.1. Economists, with a singular 
lack of imagination, call such goods normal goods. If good 1 is a normal 
good, then the demand for it increases when income increases, and de¬ 
creases when income decreases. For a normal good the quantity demanded 
always changes in the same way as income changes: 


Aaq 

Am 


> 0 . 


If something is called normal, you can be sure that there must be a 
possibility of being abnormal. And indeed there is. Figure 6.2 presents 
an example of nice, well-behaved indifference curves where an increase of 
income results in a reduction in the consumption of one of the goods. Such 
a good is called an inferior good. This may be “abnormal,” but when 
you think about it, inferior goods aren’t all that unusual. There are many 
goods for which demand decreases as income increases; examples might 
include gruel, bologna, shacks, or nearly any kind of low-quality good. 

Whether a good is inferior or not depends on the income level that we 
are examining. It might very well be that very poor people consume more 
bologna as their income increases. But after a point, the consumption of 
bologna would probably decline as income continued to increase. Since in 
real life the consumption of goods can increase or decrease when income 
increases, it is comforting to know that economic theory allows for both 
possibilities. 
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Normal goods. The demand for both goods increases when 
income increases, so both goods are normal goods. 


6.2 Income Offer Curves and Engel Curves 


We have seen that an increase in income corresponds to shifting the budget 
line outward in a parallel manner. We can connect together the demanded 
bundles that we get as we shift the budget line outward to construct the 
income offer curve. This curve illustrates the bundles of goods that are 
demanded at the different levels of income, as depicted in Figure 6.3A. 
The income offer curve is also known as the income expansion path. If 
both goods are normal goods, then the income expansion path will have a 
positive slope, as depicted in Figure 6.3A. 

For each level of income, m, there will be some optimal choice for each 
of the goods. Let us focus on good 1 and consider the optimal choice at 
each set of prices and income, Xi(pi,p 2 ,m). This is simply the demand 
function for good 1. If we hold the prices of goods 1 and 2 fixed and look 
at how demand changes as we change income, we generate a curve known 
as the Engel curve. The Engel curve is a graph of the demand for one of 
the goods as a function of income, with all prices being held constant. For 
an example of an Engel curve, see Figure 6.3B. 


Figure 

6.1 
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Figure An inferior good. Good 1 is an inferior good, which means 

6.2 that the demand for it decreases when income increases. 


Figure 

6.3 
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A Income offer curve 

B Engel curve 

How demand changes as income changes. The income of¬ 
fer curve (or income expansion path) shown in panel A depicts 
the optimal choice at different levels of income and constant 
prices. When we plot the optimal choice of good 1 against in¬ 
come, m, we get the Engel curve, depicted in panel B. 













SOME EXAMPLES 99 


6.3 Some Examples 

Let’s consider some of the preferences that we examined in Chapter 5 and 
see what their income offer curves and Engel curves look like. 


Perfect Substitutes 

The case of perfect substitutes is depicted in Figure 6.4. If p\ < p 2 , so 
that the consumer is specializing in consuming good 1, then if his income 
increases he will increase his consumption of good 1. Thus the income offer 
curve is the horizontal axis, as shown in Figure 6.4A. 



m 


Engel 

curve 


Slope = p, 


B Engel curve 


Perfect substitutes. The income offer curve (A) and an Engel 
curve (B) in the case of perfect substitutes. 


Figure 

6.4 


Since the demand for good 1 is X\ = m/pi in this case, the Engel curve 
will be a straight line with a slope of pi, as depicted in Figure 6.4B. (Since 
m, is on the vertical axis, and x\ on the horizontal axis, we can write 
m = piXi, which makes it clear that the slope is p\.) 

Perfect Complements 

The demand behavior for perfect complements is shown in Figure 6.5. Since 
the consumer will always consume the same amount of each good, no matter 
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what, the income offer curve is the diagonal line through the origin as 
depicted in Figure 6.5A. We have seen that the demand for good 1 is 
X\ = m/(pi + P2 ), so the Engel curve is a straight line with a slope of 
Pi + P 2 as shown in Figure 6.5B. 


Figure 

6.5 



m 


Engel 

curve 


Slope = P] + p 2 


B Engel curve 


Perfect complements. The income offer curve (A) and an 
Engel curve (B) in the case of perfect complements. 


Cobb-Douglas Preferences 

For the case of Cobb-Douglas preferences it is easier to look at the algebraic 
form of the demand functions to see what the graphs will look like. If 
u(x\,X 2 ) = x1x \~ a , the Cobb-Douglas demand for good 1 has the form 
x\ = am/pi . For a fixed value of pi, this is a linear function of m. Thus 
doubling m will double demand, tripling m will triple demand, and so on. 
In fact, multiplying m by any positive number t will just multiply demand 
by the same amount. 

The demand for good 2 is X 2 = (1 — a)m/p 2 , and this is also clearly linear. 
The fact that the demand functions for both goods are linear functions 
of income means that the income expansion paths will be straight lines 
through the origin, as depicted in Figure 6.6A. The Engel curve for good 1 
will be a straight line with a slope of pi/a , as depicted in Figure 6.6B. 
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A Income offer curve 

B Engel curve 

Cobb-Douglas. An income offer curve (A) and an Engel curve 
(B) for Cobb-Douglas utility. 


Figure 

6.6 


Homothetic Preferences 

All of the income offer curves and Engel curves that we have seen up to now 
have been straightforward -in fact they’ve been straight lines! This has 
happened because our examples have been so simple. Real Engel curves do 
not have to be straight lines. In general, when income goes up, the demand 
for a good could increase more or less rapidly than income increases. If the 
demand for a good goes up by a greater proportion than income, we say 
that it is a luxury good, and if it goes up by a lesser proportion than 
income we say that it is a necessary good. 

The dividing line is the case where the demand for a good goes up by 
the same proportion as income. This is what happened in the three cases 
we examined above. What aspect of the consumer’s preferences leads to 
this behavior? 

Suppose that the consumer’s preferences only depend on the ratio of 
good 1 to good 2. This means that if the consumer prefers (xi,x 2 ) to 
(j/ 1 , 2 / 2 )) then she automatically prefers (2a;!, 2 ^ 2 ) to (2yi,2y 2 ) 1 (3xi,3a:2) 
to (3y\,3y 2 ), and so on, since the ratio of good 1 to good 2 is the same for 
all of these bundles. In fact, the consumer prefers (tx\,tx 2 ) to (tyi,ty 2 ) for 
any positive value of t. Preferences that have this property are known as 
homothetic preferences. It is not hard to show that the three examples 
of preferences given above—perfect substitutes, perfect complements, and 
Cobb-Douglas—are all homothetic preferences. 
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If the consumer has homothetic preferences, then the income offer curves 
are all straight lines through the origin, as shown in Figure 6.7. More 
specifically, if preferences are homothetic, it means that when income is 
scaled up or down by any amount t > 0, the demanded bundle scales up 
or down by the same amount. This can be established rigorously, but it is 
fairly clear from looking at the picture. If the indifference curve is tangent 
to the budget line at (x*, x%), then the indifference curve through {tx\. tx* 2 ) 
is tangent to the budget line that has t times as much income and the same 
prices. This implies that the Engel curves are straight lines as well. If you 
double income, you just double the demand for each good. 


Figure 

6.7 
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A Income offer curve 

B Engel curve 

Homothetic preferences. An income offer curve (A) and an 

Engel 

curve (B) in the case of homothetic preferences. 


Homothetic preferences are very convenient since the income effects are 
so simple. Unfortunately, homothetic preferences aren’t very realistic for 
the same reason! But they will often be of use in our examples. 


Quasilinear Preferences 

Another kind of preferences that generates a special form of income offer 
curves and Engel curves is the case of quasilinear preferences. Recall the 
definition of quasilinear preferences given in Chapter 4. This is the case 
where all indifference curves are “shifted” versions of one indifference curve 
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as in Figure 6.8. Equivalently, the utility function for these preferences 
takes the form u(xi,x 2 ) = v{x\) + X 2 - What happens if we shift the budget 
line outward? In this case, if an indifference curve is tangent to the budget 
line at a bundle ( x\,X 2 ), then another indifference curve must also be 
tangent at ( x \, +k) for any constant k. Increasing income doesn’t change 

the demand for good 1 at all, and all the extra income goes entirely to the 
consumption of good 2. If preferences are quasilinear, we sometimes say 
that there is a “zero income effect” for good 1. Thus the Engel curve for 
good 1 is a vertical line—as you change income, the demand for good 1 
remains constant. 
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A Income offer curve 

B Engel curve 

Quasilinear preferences. An income offer curve (A) and an 

Enge' 

curve (B) with quasilinear preferences. 


Figure 

6.8 


What would be a real-life situation where this kind of thing might occur? 
Suppose good 1 is pencils and good 2 is money to spend on other goods. 
Initially I may spend my income only on pencils, but when my income 
gets large enough, I stop buying additional pencils—all of my extra income 
is spent on other goods. Other examples of this sort might be salt or 
toothpaste. When we are examining a choice between all other goods and 
some single good that isn’t a very large part of the consumer’s budget, the 
quasilinear assumption may well be plausible, at least when the consumer’s 
income is sufficiently large. 
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Figure 

6.9 


6.4 Ordinary Goods and Giffen Goods 

Let us now consider price changes. Suppose that we decrease the price of 
good 1 and hold the price of good 2 and money income fixed. Then what 
can happen to the quantity demanded of good 1? Intuition tells us that 
the quantity demanded of good 1 should increase when its price decreases. 
Indeed this is the ordinary case, as depicted in Figure 6.9. 



An ordinary good. Ordinarily, the demand for a good in¬ 
creases when its price decreases, as is the case here. 


When the price of good 1 decreases, the budget line becomes flatter. Or 
said another way, the vertical intercept is fixed and the horizontal intercept 
moves to the right. In Figure 6.9, the optimal choice of good 1 moves to 
the right as well: the quantity demanded of good 1 has increased. But we 
might wonder whether this always happens this way. Is it always the case 
that, no matter what kind of preferences the consumer has, the demand 
for a good must increase when its price goes down? 

As it turns out, the answer is no. It is logically possible to find well- 
behaved preferences for which a decrease in the price of good 1 leads to a 
reduction in the demand for good 1. Such a good is called a Giffen good, 
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for good 1 

A Giffen good. Good 1 is a Giffen good, since the demand 
for it decreases when its price decreases. 


Figure 

6.10 


after the nineteenth-century economist who first noted the possibility. An 
example is illustrated in Figure 6.10. 

What is going on here in economic terms? What kind of preferences 
might give rise to the peculiar behavior depicted in Figure 6.10? Suppose 
that the two goods that you are consuming are gruel and milk and that 
you are currently consuming 7 bowls of gruel and 7 cups of milk a week. 
Now the price of gruel declines. If you consume the same 7 bowls of gruel 
a week, you will have money left over with which you can purchase more 
milk. In fact, with the extra money you have saved because of the lower 
price of gruel, you may decide to consume even more milk and reduce your 
consumption of gruel. The reduction in the price of gruel has freed up some 
extra money to be spent on other things—but one thing you might want to 
do with it is reduce your consumption of gruel! Thus the price change is to 
some extent like an income change. Even though money income remains 
constant, a change in the price of a good will change purchasing power, 
and thereby change demand. 

So the Giffen good is not implausible purely on logical grounds, although 
Giffen goods are unlikely to be encountered in real-world behavior. Most 
goods are ordinary goods—when their price increases, the demand for them 
declines. We’ll see why this is the ordinary situation a little later. 
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Incidentally, it is no accident that we used gruel as an example of both 
an inferior good and a Giffen good. It turns out that there is an intimate 
relationship between the two which we will explore in a later chapter. 

But for now our exploration of consumer theory may leave you with 
the impression that nearly anything can happen: if income increases the 
demand for a good can go up or down, and if price increases the demand can 
go up or down. Is consumer theory compatible with any kind of behavior? 
Or are there some kinds of behavior that the economic model of consumer 
behavior rules out? It turns out that there are restrictions on behavior 
imposed by the maximizing model. But we’ll have to wait until the next 
chapter to see what they are. 


6.5 The Price Offer Curve and the Demand Curve 

Suppose that we let the price of good 1 change while we hold p 2 and income 
fixed. Geometrically this involves pivoting the budget line. We can think of 
connecting together the optimal points to construct the price offer curve 
as illustrated in Figure 6.11A. This curve represents the bundles that would 
be demanded at different prices for good 1. 


Figure 

6.11 
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The price offer curve and demand curve. Panel A contains 
a price offer curve, which depicts the optimal choices as the price 
of good 1 changes. Panel B contains the associated demand 
curve, which depicts a plot of the optimal choice of good 1 as a 
function of its price. 
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We can depict this same information in a different way. Again, hold 
the price of good 2 and money income fixed, and for each different value 
of pi plot the optimal level of consumption of good 1. The result is the 
demand curve depicted in Figure 6.1 IB. The demand curve is a plot 
of the demand function, Xi(pi,p 2 ,m), holding p 2 and m fixed at some 
predetermined values. 

Ordinarily, when the price of a good increases, the demand for that 
good will decrease. Thus the price and quantity of a good will move in 
opposite directions, which means that the demand curve will typically have 
a negative slope. In terms of rates of change, we would normally have 

Axi 


Api 


< 0 , 


which simply says that demand curves usually have a negative slope. 

However, we have also seen that in the case of Giffen goods, the demand 
for a good may decrease when its price decreases. Thus it is possible, but 
not likely, to have a demand curve with a positive slope. 


6.6 Some Examples 

Let’s look at a few examples of demand curves, using the preferences that 
we discussed in Chapter 3. 


Perfect Substitutes 

The offer curve and demand curve for perfect substitutes—the red and blue 
pencils example—are illustrated in Figure 6.12. As we saw in Chapter 5, 
the demand for good 1 is zero when p\ > P 2 , any amount on the budget 
line when pi = p 2 , and m/pi when p\ < p 2 . The offer curve traces out 
these possibilities. 

In order to find the demand curve, we fix the price of good 2 at some 
price P 2 and graph the demand for good 1 versus the price of good 1 to get 
the shape depicted in Figure 6.12B. 

Perfect Complements 

The case of perfect complements—the right and left shoes example—is 
depicted in Figure 6.13. We know that whatever the prices are, a consumer 
will demand the same amount of goods 1 and 2. Thus his offer curve will 
be a diagonal line as depicted in Figure 6.13A. 

We saw in Chapter 5 that the demand for good 1 is given by 

TO 

xi = -■-. 

Pi + P2 

If we fix to and p 2 and plot the relationship between x\ and p \, we get the 
curve depicted in Figure 6.13B. 
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Figure 

6.12 


Figure 

6.13 
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Perfect substitutes. Price offer curve (A) and demand curve 
(B) in the case of perfect substitutes. 




Perfect complements. Price offer curve (A) and demand 
curve (B) in the case of perfect complements. 


A Discrete Good 

Suppose that good 1 is a discrete good. If pi is very high then the consumer 
will strictly prefer to consume zero units; if pi is low enough the consumer 
will strictly prefer to consume one unit. At some price ri, the consumer will 
be indifferent between consuming good 1 or not consuming it. The price 
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at which the consumer is just indifferent to consuming or not consuming 
the good is called the reservation price . 1 The indifference curves and 
demand curve are depicted in Figure 6.14. 
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A discrete good. As the price of good 1 decreases there will 
be some price, the reservation price, at which the consumer is 
just indifferent between consuming good 1 or not consuming it. 

As the price decreases further, more units of the discrete good 

will be demanded. 



Figure 

6.14 


It is clear from the diagram that the demand behavior can be described 
by a sequence of reservation prices at which the consumer is just willing 
to purchase another unit of the good. At a price of ri the consumer is 
willing to buy 1 unit of the good; if the price falls to r 2 , he is willing to 
buy another unit, and so on. 

These prices can be described in terms of the original utility function. 
For example, rq is the price where the consumer is just indifferent between 
consuming 0 or 1 unit of good 1, so it must satisfy the equation 

u(0, to) = it(l, to — ri). (6-1) 

Similarly r 2 satisfies the equation 

zt(l, m — 7-2) = u(2, m — 2^). (6-2) 


1 The term reservation price comes from auction markets. When someone wanted to 
sell something in an auction he would typically state a minimum price at which he 
was willing to sell the good. If the best price offered was below this stated price, the 
seller reserved the right to purchase the item himself. This price became known as 
the seller’s reservation price and eventually came to be used to describe the price at 
which someone was just willing to buy or sell some item. 
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The left-hand side of this equation is the utility from consuming one unit of 
the good at a price of r 2 . The right-hand side is the utility from consuming 
two units of the good, each of which sells for r 2 . 

If the utility function is quasilinear, then the formulas describing the 
reservation prices become somewhat simpler. If u(xi,x 2 ) = v{x\) + x 2 , 
and 'c(O) = 0, then we can write equation (6.1) as 

u( 0 ) + m = m = u(l) + to — 7*1. 

Since u(0) = 0, we can solve for r± to find 


n = u(l). 


(6.3) 


Similarly, we can write equation (6.2) as 

v(l) + m — r 2 = v{2) + m — 2r 2 . 

Canceling terms and rearranging, this expression becomes 

r 2 = v(2) - u(l). 

Proceeding in this manner, the reservation price for the third unit of con¬ 
sumption is given by 

r 3 = v(3) - v(2) 

and so on. 

In each case, the reservation price measures the increment in utility nec¬ 
essary to induce the consumer to choose an additional unit of the good. 
Loosely speaking, the reservation prices measure the marginal utilities as¬ 
sociated with different levels of consumption of good 1. Our assumption 
of convex preferences implies that the sequence of reservation prices must 
decrease: rq > r 2 > r 3 • ■ 

Because of the special structure of the quasilinear utility function, the 
reservation prices do not depend on the amount of good 2 that the consumer 
has. This is certainly a special case, but it makes it very easy to describe 
demand behavior. Given any price p , we just find where it falls in the list 
of reservation prices. Suppose that p falls between r$ and vj, for example. 
The fact that r§> p means that the consumer is willing to give up p dollars 
per unit bought to get 6 units of good 1, and the fact that p > rq means 
that the consumer is not willing to give up p dollars per unit to get the 
seventh unit of good 1. 

This argument is quite intuitive, but let’s look at the math just to make 
sure that it is clear. Suppose that the consumer demands 6 units of good 1. 
We want to show that we must have re > p > r?. 

If the consumer is maximizing utility, then we must have 


u(6) + m — 6p > v{x\) + m — px i 
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for all possible choices of x\. In particular, we must have that 

u(6) + m — 6p > v(5) + m — 5 p. 

Rearranging this equation we have 

r 6 = i>(6) - v(5) > p , 

which is half of what we wanted to show. 

By the same logic, 

r>(6) + to — 6p > v(7) + m—7p. 
Rearranging this gives us 

p > v{7) — v(6) = r 7 , 

which is the other half of the inequality we wanted to establish. 


6.7 Substitutes and Complements 


We have already used the terms substitutes and complements, but it is now 
appropriate to give a formal definition. Since we have seen perfect substi¬ 
tutes and perfect complements several times already, it seems reasonable 
to look at the imperfect case. 

Let’s think about substitutes first. We said that red pencils and blue 
pencils might be thought of as perfect substitutes, at least for someone who 
didn’t care about color. But what about pencils and pens? This is a case 
of “imperfect” substitutes. That is, pens and pencils are, to some degree, 
a substitute for each other, although they aren’t as perfect a substitute for 
each other as red pencils and blue pencils. 

Similarly, we said that right shoes and left shoes were perfect comple¬ 
ments. But what about a pair of shoes and a pair of socks? Right shoes 
and left shoes are nearly always consumed together, and shoes and socks 
are usually consumed together. Complementary goods are those like shoes 
and socks that tend to be consumed together, albeit not always. 

Now that we’ve discussed the basic idea of complements and substitutes, 
we can give a precise economic definition. Recall that the demand function 
for good 1, say, will typically be a function of the price of both good 1 and 
good 2, so we write Xi(pi,p 2 ,rn). We can ask how the demand for good 1 
changes as the price of good 2 changes: does it go up or down? 

If the demand for good 1 goes up when the price of good 2 goes up, then 
we say that good 1 is a substitute for good 2. In terms of rates of change, 
good 1 is a substitute for good 2 if 


Ai’i 

Ap 2 


> 0 . 
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The idea is that when good 2 gets more expensive the consumer switches to 
consuming good 1: the consumer substitutes away from the more expensive 
good to the less expensive good. 

On the other hand, if the demand for good 1 goes down when the price 
of good 2 goes up, we say that good 1 is a complement to good 2. This 
means that 


Ai’i 

Ap 2 


< 0 . 


Complements are goods that are consumed together, like coffee and sugar, 
so when the price of one good rises, the consumption of both goods will 
tend to decrease. 

The cases of perfect substitutes and perfect complements illustrate these 
points nicely. Note that Axi / Ap 2 is positive (or zero) in the case of perfect 
substitutes, and that Axi/Ap 2 is negative in the case of perfect comple¬ 
ments. 

A couple of warnings are in order about these concepts. First, the two- 
good case is rather special when it comes to complements and substitutes. 
Since income is being held fixed, if you spend more money on good 1, you’ll 
have to spend less on good 2. This puts some restrictions on the kinds of 
behavior that are possible. When there are more than two goods, these 
restrictions are not so much of a problem. 

Second, although the definition of substitutes and complements in terms 
of consumer demand behavior seems sensible, there are some difficulties 
with the definitions in more general environments. For example, if we use 
the above definitions in a situation involving more than two goods, it is 
perfectly possible that good 1 may be a substitute for good 3, but good 3 
may be a complement for good 1. Because of this peculiar feature, more 
advanced treatments typically use a somewhat different definition of sub¬ 
stitutes and complements. The definitions given above describe concepts 
known as gross substitutes and gross complements; they will be suf¬ 
ficient for our needs. 


6.8 The Inverse Demand Function 

If we hold p 2 and m fixed and plot p\ against X\ we get the demand 
curve. As suggested above, we typically think that the demand curve 
slopes downwards, so that higher prices lead to less demand, although the 
Giffen example shows that it could be otherwise. 

As long as we do have a downward-sloping demand curve, as is usual, 
it is meaningful to speak of the inverse demand function. The inverse 
demand function is the demand function viewing price as a function of 
quantity. That is, for each level of demand for good 1, the inverse demand 
function measures what the price of good 1 would have to be in order for 
the consumer to choose that level of consumption. So the inverse demand 
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function measures the same relationship as the direct demand function, but 
just from another point of view. Figure 6.15 depicts the inverse demand 
function—or the direct demand function, depending on your point of view. 



Inverse demand curve. If you view the demand curve as 
measuring price as a function of quantity, you have an inverse 
demand function. 


Recall, for example, the Cobb-Douglas demand for good 1, x\ = am/pi. 
We could just as well write the relationship between price and quantity as 
Pi = am/x i- The first representation is the direct demand function; the 
second is the inverse demand function. 

The inverse demand function has a useful economic interpretation. Recall 
that as long as both goods are being consumed in positive amounts, the 
optimal choice must satisfy the condition that the absolute value of the 
MRS equals the price ratio: 

I MRS | = —. 

P2 

This says that at the optimal level of demand for good 1, for example, we 
must have 

P!=p 2 |MRS|. (6.4) 

Thus, at the optimal level of demand for good 1, the price of good 1 
is proportional to the absolute value of the MRS between good 1 and 
good 2. 


Figure 

6.15 
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Suppose for simplicity that the price of good 2 is one. Then equation 
(6.4) tells us that at the optimal level of demand, the price of good 1 
measures how much the consumer is willing to give up of good 2 in order 
to get a little more of good 1. In this case the inverse demand func¬ 
tion is simply measuring the absolute value of the MRS. For any opti¬ 
mal level of x\ the inverse demand function tells how much of good 2 
the consumer would want to have to compensate him for a small reduc¬ 
tion in the amount of good 1. Or, turning this around, the inverse de¬ 
mand function measures how much the consumer would be willing to sac¬ 
rifice of good 2 to make him just indifferent to having a little more of 
good 1. 

If we think of good 2 as being money to spend on other goods, then we 
can think of the MRS as being how many dollars the individual would be 
willing to give up to have a little more of good 1. We suggested earlier that 
in this case, we can think of the MRS as measuring the marginal willingness 
to pay. Since the price of good 1 is just the MRS in this case, this means 
that the price of good 1 itself is measuring the marginal willingness to 
pay. 

At each quantity aq, the inverse demand function measures how many 
dollars the consumer is willing to give up for a little more of good 1; or, 
said another way, how many dollars the consumer was willing to give up for 
the last unit purchased of good 1. For a small enough amount of good 1, 
they come down to the same thing. 

Looked at in this way, the downward-sloping demand curve has a new 
meaning. When aq is very small, the consumer is willing to give up a lot of 
money—that is, a lot of other goods, to acquire a little bit more of good 1. 
As x\ is larger, the consumer is willing to give up less money, on the margin, 
to acquire a little more of good 1. Thus the marginal willingness to pay, 
in the sense of the marginal willingness to sacrifice good 2 for good 1, is 
decreasing as we increase the consumption of good 1. 


Summary 


1. The consumer’s demand function for a good will in general depend on 
the prices of all goods and income. 

2. A normal good is one for which the demand increases when income 
increases. An inferior good is one for which the demand decreases when 
income increases. 

3. An ordinary good is one for which the demand decreases when its price 
increases. A Giffen good is one for which the demand increases when its 
price increases. 
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4. If the demand for good 1 increases when the price of good 2 increases, 
then good 1 is a substitute for good 2. If the demand for good 1 decreases 
in this situation, then it is a complement for good 2. 

5. The inverse demand function measures the price at which a given quan¬ 
tity will be demanded. The height of the demand curve at a given level 
of consumption measures the marginal willingness to pay for an additional 
unit of the good at that consumption level. 


REVIEW QUESTIONS 


1. If the consumer is consuming exactly two goods, and she is always spend¬ 
ing all of her money, can both of them be inferior goods? 

2. Show that perfect substitutes are an example of homothetic preferences. 

3. Show that Cobb-Douglas preferences are homothetic preferences. 

4. The income offer curve is to the Engel curve as the price offer curve is 

to ...? 

5. If the preferences are concave will the consumer ever consume both of 
the goods together? 

6. Are hamburgers and buns complements or substitutes? 

7. What is the form of the inverse demand function for good 1 in the case 
of perfect complements? 

8. True or false? If the demand function is x\ = —pi, then the inverse 
demand function is x = — 1 /p\. 


APPENDIX 

If preferences take a special form, this will mean that the demand functions that 
come from those preferences will take a special form. In Chapter 4 we described 
quasilinear preferences. These preferences involve indifference curves that are all 
parallel to one another and can be represented by a utility function of the form 

U{X 1 ,* 2 ) = V(xi) +X2- 

The maximization problem for a utility function like this is 


max v(xi) + X 2 
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s.t. pixi + P 2 X 2 = m. 

Solving the budget constraint for x 2 as a function of x\ and substituting into the 
objective function, we have 

maxr(ii) + m/p 2 — pixi/p 2 - 


Differentiating gives us the first-order condition 


v'(xl) 


pi 

P 2 ' 


This demand function has the interesting feature that the demand for good 1 
must be independent of income—just as we saw by using indifference curves. 
The inverse demand curve is given by 


Pi (ah) = v'(xi)p 2 . 


That is, the inverse demand function for good 1 is the derivative of the utility 
function times P 2 . Once we have the demand function for good 1, the demand 
function for good 2 comes from the budget constraint. 

For example, let us calculate the demand functions for the utility function 

u(xi,X 2 ) = hi ah + x 2 . 


Applying the first-order condition gives 

J_ _ Pi 
xi p 2 ’ 

so the direct demand function for good 1 is 


and the inverse demand function is 


Pi (ah) 


P2 

ah ’ 


The direct demand function for good 2 comes from substituting ah = P 2 /P 1 
into the budget constraint: 


m 

X2 = - 1 . 

P2 


A warning is in order concerning these demand functions. Note that the de¬ 
mand for good 1 is independent of income in this example. This is a general 
feature of a quasilinear utility function—the demand for good 1 remains con¬ 
stant as income changes. However, this can only be true for some values of 
income. A demand function can’t literally be independent of income for all val¬ 
ues of income; after all, when income is zero, all demands are zero. It turns 
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out that the quasilinear demand function derived above is only relevant when a 
positive amount of each good is being consumed. 

In this example, when m < P 2 , the optimal consumption of good 2 will be zero. 
As income increases the marginal utility of consumption of good 1 decreases. 
When m = p 2 , the marginal utility from spending additional income on good 
1 just equals the marginal utility from spending additional income on good 2. 
After that point, the consumer spends all additional income on good 2. 

So a better way to write the demand for good 2 is: 


when m < 

<2 — 1 when m > P 2 


J° 

l m/p- 


x 2 = 


For more on the properties of quasilinear demand functions see Hal R. Varian, 
Microeconomic Analysis, 3rd ed. (New York: Norton, 1992). 


CHAPTER 



REVEALED 

PREFERENCE 


In Chapter 6 we saw how we can use information about the consumer’s 
preferences and budget constraint to determine his or her demand. In 
this chapter we reverse this process and show how we can use informa¬ 
tion about the consumer’s demand to discover information about his or 
her preferences. Up until now, we were thinking about what preferences 
could tell us about people’s behavior. But in real life, preferences are 
not directly observable: we have to discover people’s preferences from 
observing their behavior. In this chapter we’ll develop some tools to do 
this. 

When we talk of determining people’s preferences from observing their 
behavior, we have to assume that the preferences will remain unchanged 
while we observe the behavior. Over very long time spans, this is not very 
reasonable. But for the monthly or quarterly time spans that economists 
usually deal with, it seems unlikely that a particular consumer’s tastes 
would change radically. Thus we will adopt a maintained hypothesis that 
the consumer’s preferences are stable over the time period for which we 
observe his or her choice behavior. 
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7.1 The Idea of Revealed Preference 

Before we begin this investigation, let’s adopt the convention that in this 
chapter, the underlying preferences—whatever they may be—are known 
to be strictly convex. Thus there will be a unique demanded bundle at 
each budget. This assumption is not necessary for the theory of revealed 
preference, but the exposition will be simpler with it. 

Consider Figure 7.1, where we have depicted a consumer’s demanded 
bundle, and another arbitrary bundle, (j/i,J/ 2 ), that is beneath 

the consumer’s budget line. Suppose that we are willing to postulate that 
this consumer is an optimizing consumer of the sort we have been study¬ 
ing. What can we say about the consumer’s preferences between these two 
bundles of goods? 



Revealed preference. The bundle (xi, £ 2 ) that the consumer 
chooses is revealed preferred to the bundle (yi, y 2 ), a bundle that 
he could have chosen. 


Well, the bundle (j/i, 2 / 2 ) is certainly an affordable purchase at the given 
budget—the consumer could have bought it if he or she wanted to, and 
would even have had money left over. Since (x\,X 2 ) is the optimal bundle, 
it must be better than anything else that the consumer could afford. Hence, 
in particular it must be better than (j/i, 2 / 2 ) • 

The same argument holds for any bundle on or underneath the budget 
line other than the demanded bundle. Since it could have been bought at 


Figure 

7.1 
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the given budget but wasn’t, then what was bought must be better. Here 
is where we use the assumption that there is a unique demanded bundle 
for each budget. If preferences are not strictly convex, so that indifference 
curves have flat spots, it may be that some bundles that are on the budget 
line might be just as good as the demanded bundle. This complication can 
be handled without too much difficulty, but it is easier to just assume it 
away. 

In Figure 7.1 all of the bundles in the shaded area underneath the budget 
line are revealed worse than the demanded bundle {x\, £ 2 ). This is because 
they could have been chosen, but were rejected in favor of (£ 1 , £ 2 ). We will 
now translate this geometric discussion of revealed preference into algebra. 

Let (xi, £ 2 ) be the bundle purchased at prices {pi,P 2 ) when the consumer 
has income m. What does it mean to say that (j/i, 2 / 2 ) is affordable at 
those prices and income? It simply means that (s/i, 2 / 2 ) satisfies the budget 
constraint 

PiVi + P2IJ2 < m. 

Since (£i,£ 2 ) is actually bought at the given budget, it must satisfy the 
budget constraint with equality 

Pi£i +7*22:2 = m. 

Putting these two equations together, the fact that (yi,y 2 ) is affordable at 
the budget (pi,p 2 ,rri) means that 

P1X1 + p 2 x 2 > piyi + P2V2- 

If the above inequality is satisfied and (3/1,2/2 ) is actually a different 
bundle from (£1, £2), we say that (£1,012) is directly revealed preferred 
to (2/1,3/2)- 

Note that the left-hand side of this inequality is the expenditure on the 
bundle that is actually chosen at prices [pi,P 2 )- Thus revealed preference is 
a relation that holds between the bundle that is actually demanded at some 
budget and the bundles that could have been demanded at that budget. 

The term “revealed preference” is actually a bit misleading. It does not 
inherently have anything to do with preferences, although we’ve seen above 
that if the consumer is making optimal choices, the two ideas are closely 
related. Instead of saying “X is revealed preferred to Y” it would be better 
to say “X is chosen over Y.” When we say that X is revealed preferred to 
Y, all we are claiming is that X is chosen when Y could have been chosen; 
that is, that piXi + p 2 x 2 > PiVi +P2V2- 


7.2 From Revealed Preference to Preference 

We can summarize the above section very simply. It follows from our model 
of consumer behavior -that people are choosing the best things they can 
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afford—that the choices they make are preferred to the choices that they 
could have made. Or, in the terminology of the last section, if ( 21 , 22 ) is 
directly revealed preferred to ( 2 / 1 , 3 / 2 ), then ( 21 , 22 ) is in fact preferred to 
( 3 / 1 , 2 / 2 )• Let us state this principle more formally: 

The Principle of Revealed Preference. Let ( x±,X2 ) be the chosen 
bundle when prices are (pi,P 2 ), and let ( 2/1 ? 2 / 2 ) be some other bundle such 
that p\X\ +P2X2 > Piyi T P2IJ2 • Then if the consumer is choosing the most 
preferred bundle she can afford, we must have ( 21 , 22 ) >~ (9 1 , 2 / 2 )- 

When you first encounter this principle, it may seem circular. If X is re¬ 
vealed preferred to Y, doesn’t that automatically mean that X is preferred 
to Y? The answer is no. “Revealed preferred” just means that X was cho¬ 
sen when Y was affordable; “preference” means that the consumer ranks 
X ahead of Y. If the consumer chooses the best bundles she can afford, 
then “revealed preference” implies “preference,” but that is a consequence 
of the model of behavior, not the definitions of the terms. 

This is why it would be better to say that one bundle is “chosen over” 
another, as suggested above. Then we would state the principle of revealed 
preference by saying: “If a bundle X is chosen over a bundle Y, then X 
must be preferred to Y.” In this statement it is clear how the model of 
behavior allows us to use observed choices to infer something about the 
underlying preferences. 

Whatever terminology you use, the essential point is clear: if we observe 
that one bundle is chosen when another one is affordable, then we have 
learned something about the preferences between the two bundles: namely, 
that the first is preferred to the second. 

Now suppose that we happen to know that ( 2 / 1 , 1 / 2 ) is a demanded bundle 
at prices ( 91 , 92 ) and that ( 91 , 92 ) is itself revealed preferred to some other 
bundle ( 21 , 22 ). That is, 


9i9i + 9292 > 9i z i + 92^2- 

Then we know that ( 21 , 22 ) >~ ( 91 , 92 ) and that ( 91 , 92 ) >~ ( 21 , 22 )- From 
the transitivity assumption we can conclude that ( 21 , 2 : 2 ) (£ 1 , 22 ). 

This argument is illustrated in Figure 7.2. Revealed preference and tran¬ 
sitivity tell us that ( 21 , 22 ) must be better than ( 21 , 22 ) for the consumer 
who made the illustrated choices. 

It is natural to say that in this case ( 21 , 22 ) is indirectly revealed 
preferred to ( 21 , 22 ). Of course the “chain” of observed choices may be 
longer than just three: if bundle A is directly revealed preferred to B, and 
B to C, and C to D, ... all the way to M, say, then bundle A is still 
indirectly revealed preferred to M. The chain of direct comparisons can be 
of any length. 

If a bundle is either directly or indirectly revealed preferred to another 
bundle, we will say that the first bundle is revealed preferred to the 
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Figure 

7.2 



Indirect revealed preference. The bundle (x\,X 2 ) is indi¬ 
rectly revealed preferred to the bundle ( 21 , 22 ). 


second. The idea of revealed preference is simple, but it is surprisingly 
powerful. Just looking at a consumer’s choices can give us a lot of infor¬ 
mation about the underlying preferences. Consider, for example, Figure 
7.2. Here we have several observations on demanded bundles at different 
budgets. We can conclude from these observations that since (aq,^) is 
revealed preferred, either directly or indirectly, to all of the bundles in the 
shaded area, (xi,X 2 ) is in fact preferred to those bundles by the consumer 
who made these choices. Another way to say this is to note that the true in¬ 
difference curve through (aq,^), whatever it is, must lie above the shaded 
region. 


7.3 Recovering Preferences 

By observing choices made by the consumer, we can learn about his or her 
preferences. As we observe more and more choices, we can get a better and 
better estimate of what the consumer’s preferences are like. 

Such information about preferences can be very important in making 
policy decisions. Most economic policy involves trading off some goods for 
others: if we put a tax on shoes and subsidize clothing, we’ll probably end 
up having more clothes and fewer shoes. In order to evaluate the desirabil¬ 
ity of such a policy, it is important to have some idea of what consumer 
preferences between clothes and shoes look like. By examining consumer 
choices, we can extract such information through the use of revealed pref¬ 
erence and related techniques. 
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If we are willing to add more assumptions about consumer preferences, 
we can get more precise estimates about the shape of indifference curves. 
For example, suppose we observe two bundles Y and Z that are revealed 
preferred to X, as in Figure 7.3, and that we are willing to postulate 
preferences are convex. Then we know that all of the weighted averages 
of Y and Z are preferred to X as well. If we are willing to assume that 
preferences are monotonic, then all the bundles that have more of both 
goods than X , Y, and Z —or any of their weighted averages—are also 
preferred to X. 



Trapping the indifference curve. The upper shaded area 
consists of bundles preferred to X, and the lower shaded area 
consists of bundles revealed worse than X. The indifference 
curve through X must lie somewhere in the region between the 
two shaded areas. 


The region labeled “Worse bundles” in Figure 7.3 consists of all the 
bundles to which X is revealed preferred. That is, this region consists of 
all the bundles that cost less than X, along with all the bundles that cost 
less than bundles that cost less than X, and so on. 


Figure 

7.3 
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Thus, in Figure 7 . 3 , we can conclude that all of the bundles in the upper 
shaded area are better than X , and that all of the bundles in the lower 
shaded area are worse than X , according to the preferences of the con¬ 
sumer who made the choices. The true indifference curve through X must 
lie somewhere between the two shaded sets. We’ve managed to trap the 
indifference curve quite tightly simply by an intelligent application of the 
idea of revealed preference and a few simple assumptions about preferences. 


7.4 The Weak Axiom of Revealed Preference 

All of the above relies on the assumption that the consumer has preferences 
and that she is always choosing the best bundle of goods she can afford. If 
the consumer is not behaving this way, the “estimates” of the indifference 
curves that we constructed above have no meaning. The question naturally 
arises: how can we tell if the consumer is following the maximizing model? 
Or, to turn it around: what kind of observation would lead us to conclude 
that the consumer was not maximizing? 

Consider the situation illustrated in Figure 7 . 4 . Could both of these 
choices be generated by a maximizing consumer? According to the logic 
of revealed preference, Figure 7.4 allows us to conclude two things: ( 1 ) 
(xi,X2) is preferred to (j/i, 2/2) ; and ( 2 ) (2/1, J/2) is preferred to ( X\,X2 )■ 
This is clearly absurd. In Figure 7.4 the consumer has apparently chosen 
(xi,X2) when she could have chosen (j/i, J/2), indicating that ( xi,X2 ) was 
preferred to (y 1, 2/2)1 but then she chose (2/1, 2/2) when she could have chosen 
( Xi,X2 )—indicating the opposite! 

Clearly, this consumer cannot be a maximizing consumer. Either the 
consumer is not choosing the best bundle she can afford, or there is some 
other aspect of the choice problem that has changed that we have not ob¬ 
served. Perhaps the consumer’s tastes or some other aspect of her economic 
environment have changed. In any event, a violation of this sort is not con¬ 
sistent with the model of consumer choice in an unchanged environment. 

The theory of consumer choice implies that such observations will not 
occur. If the consumers are choosing the best things they can afford, then 
things that are affordable, but not chosen, must be worse than what is 
chosen. Economists have formulated this simple point in the following 
basic axiom of consumer theory 

Weak Axiom of Revealed Preference (WARP), If (x\ , .7:2) is directly 
revealed preferred to (2/1,2/2), and the two bundles are not the same, then it 
cannot happen that (221,2/2) is directly revealed preferred to (xi,X2). 

In other words, if a bundle (xi,X2) is purchased at prices (j?i,j?2) and a 
different bundle (221,2/2) is purchased at prices (qi,q2), then if 


PlXi + P2X2 > PlVl + P 2 // 2 , 
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Violation of the Weak Axiom of Revealed Preference. 

A consumer who chooses both {x\,X 2 ) and ( 2 / 1 , 2 / 2 ) violates the 
Weak Axiom of Revealed Preference. 


it must not be the case that 

9l2/l + 922/2 > qiXl + 92^2 ■ 

In English: if the y-bundle is affordable when the x-bundle is purchased, 
then when the y-bundle is purchased, the x-bundle must not be affordable. 

The consumer in Figure 7.4 has violated WARP. Thus we know that this 
consumer’s behavior could not have been maximizing behavior. 1 

There is no set of indifference curves that could be drawn in Figure 7.4 
that could make both bundles maximizing bundles. On the other hand, 
the consumer in Figure 7.5 satisfies WARP. Here it is possible to find 
indifference curves for which his behavior is optimal behavior. One possible 
choice of indifference curves is illustrated. 


7.5 Checking WARP 

It is important to understand that WARP is a condition that must be sat¬ 
isfied by a consumer who is always choosing the best things he or she can 
afford. The Weak Axiom of Revealed Preference is a logical implication 


Figure 

7.4 


1 


Could we say his behavior is WARPed? Well, we could, but not in polite company. 
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Figure 

7.5 


Table 

7.1 



Satisfying WARP. Consumer choices that satisfy the Weak 
Axiom of Revealed Preference and some possible indifference 
curves. 


of that model and can therefore be used to check whether or not a partic¬ 
ular consumer, or an economic entity that we might want to model as a 
consumer, is consistent with our economic model. 

Let’s consider how we would go about systematically testing WARP in 
practice. Suppose that we observe several choices of bundles of goods at 
different prices. Let us use (pi,p|) to denote the t th observation of prices 
and {x\,x\) to denote the t th observation of choices. To use a specific 
example, let’s take the data in Table 7.1. 


Some consumption data. 


Observation 

Pi 

P2 

Xl 

X2 

1 

1 

2 

1 

2 

2 

2 

1 

2 

1 

3 

1 

1 

2 

2 


Given these data, we can compute how much it would cost the consumer 
to purchase each bundle of goods at each different set of prices, as we’ve 
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done in Table 7.2. For example, the entry in row 3, column 1, measures 
how much money the consumer would have to spend at the third set of 
prices to purchase the first bundle of goods. 


Cost of each bundle at each set of prices. 

Bundles 



1 

2 

3 

1 

5 

4* 

6 

Prices 2 

4* 

5 

6 

3 

3* 

3* 

4 


Table 

7.2 


The diagonal terms in Table 7.2 measure how much money the consumer 
is spending at each choice. The other entries in each row measure how much 
she would have spent if she had purchased a different bundle. Thus we can 
see whether bundle 3, say, is revealed preferred to bundle 1, by seeing if the 
entry in row 3, column 1 (how much the consumer would have to spend at 
the third set of prices to purchase the first bundle) is less than the entry in 
row 3, column 3 (how much the consumer actually spent at the third set 
of prices to purchase the third bundle). In this particular case, bundle 1 
was affordable when bundle 3 was purchased, which means that bundle 3 
is revealed preferred to bundle 1. Thus we put a star in row 3, column 1, 
of the table. 

From a mathematical point of view, we simply put a star in the entry in 
row s, column t, if the number in that entry is less than the number in row 
s, column s. 

We can use this table to check for violations of WARP. In this framework, 
a violation of WARP consists of two observations t and s such that row t , 
column s, contains a star and row s, column t , contains a star. For this 
would mean that the bundle purchased at s is revealed preferred to the 
bundle purchased at t and vice versa. 

We can use a computer (or a research assistant) to check and see whether 
there are any pairs of observations like these in the observed choices. If 
there are, the choices are inconsistent with the economic theory of the 
consumer. Either the theory is wrong for this particular consumer, or 
something else has changed in the consumer’s environment that we have 
not controlled for. Thus the Weak Axiom of Revealed Preference gives 
us an easily checkable condition for whether some observed choices are 
consistent with the economic theory of the consumer. 

In Table 7.2, we observe that row 1, column 2, contains a star and row 2, 
column 1, contains a star. This means that observation 2 could have been 
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chosen when the consumer actually chose observation 1 and vice versa. This 
is a violation of the Weak Axiom of Revealed Preference. We can conclude 
that the data depicted in Tables 7.1 and 7.2 could not be generated by a 
consumer with stable preferences who was always choosing the best things 
he or she could afford. 


7.6 The Strong Axiom of Revealed Preference 

The Weak Axiom of Revealed Preference described in the last section gives 
us an observable condition that must be satisfied by all optimizing con¬ 
sumers. But there is a stronger condition that is sometimes useful. 

We have already noted that if a bundle of goods X is revealed preferred 
to a bundle Y, and Y is in turn revealed preferred to a bundle Z , then X 
must in fact be preferred to Z. If the consumer has consistent preferences, 
then we should never observe a sequence of choices that would reveal that 
Z was preferred to X. 

The Weak Axiom of Revealed Preference requires that if X is directly 
revealed preferred to Y, then we should never observe Y being directly 
revealed preferred to X. The Strong Axiom of Revealed Preference 
(SARP) requires that the same sort of condition hold for indirect revealed 
preference. More formally, we have the following. 

Strong Axiom of Revealed Preference (SARP). If (x±,X 2 ) is re¬ 
vealed preferred to (j/i, z/ 2 ) (either directly or indirectly) and (j/i, Z/ 2 ) is dif¬ 
ferent from (xi,X 2 ), then (j/ 1 , 2 / 2 ) cannot be directly or indirectly revealed 
preferred to (x 1 , 3 : 2 ). 

It is clear that if the observed behavior is optimizing behavior then it 
must satisfy the SARP. For if the consumer is optimizing and (xi,X 2 ) 
is revealed preferred to (j/ 1 , 1 / 2 ), either directly or indirectly, then we must 
have (xi, X 2 ) >~ ( 2 / 1 , 2 / 2 )- So having (xi, X 2 ) revealed preferred to ( 2 / 1 , 2 / 2 ) an d 
(?7i, 2 / 2 ) revealed preferred to (xi,X 2 ) would imply that (xi,X 2 ) >~ ( 2 / 1 , 2 / 2 ) 
and ( 2 / 1 , 2 / 2 ) >- (xi, X 2 ), which is a contradiction. We can conclude that 
either the consumer must not be optimizing, or some other aspect of the 
consumer’s environment—such as tastes, other prices, and so on—must 
have changed. 

Roughly speaking, since the underlying preferences of the consumer must 
be transitive, it follows that the revealed preferences of the consumer must 
be transitive. Thus SARP is a necessary implication of optimizing behav¬ 
ior: if a consumer is always choosing the best things that he can afford, 
then his observed behavior must satisfy SARP. What is more surprising is 
that any behavior satisfying the Strong Axiom can be thought of as being 
generated by optimizing behavior in the following sense: if the observed 
choices satisfy SARP, we can always find nice, well-behaved preferences 
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that could have generated the observed choices. In this sense SARP is a 
sufficient condition for optimizing behavior: if the observed choices satisfy 
SARP, then it is always possible to find preferences for which the observed 
behavior is optimizing behavior. The proof of this claim is unfortunately 
beyond the scope of this book, but appreciation of its importance is not. 

What it means is that SARP gives us all of the restrictions on behavior 
imposed by the model of the optimizing consumer. For if the observed 
choices satisfy SARP, we can “construct” preferences that could have gen¬ 
erated these choices. Thus SARP is both a necessary and a sufficient 
condition for observed choices to be compatible with the economic model 
of consumer choice. 

Does this prove that the constructed preferences actually generated the 
observed choices? Of course not. As with any scientific statement, we can 
only show that observed behavior is not inconsistent with the statement. 
We can’t prove that the economic model is correct; we can just determine 
the implications of that model and see if observed choices are consistent 
with those implications. 


7.7 How to Check SARP 

Let us suppose that we have a table like Table 7.2 that has a star in row t, 
and column s if observation t is directly revealed preferred to observation 
s. How can we use this table to check SARP? 

The easiest way is first to transform the table. An example is given in 
Table 7.3. This is a table just like Table 7.2, but it uses a different set of 
numbers. Here the stars indicate direct revealed preference. The star in 
parentheses will be explained below. 


How to check SARP. 

Bundles 



1 

2 

3 

1 

20 

10 * 

22 W 

Prices 2 

21 

20 

15* 

3 

12 

15 

10 


Now we systematically look through the entries of the table and see 
if there are any chains of observations that make some bundle indirectly 
revealed preferred to that one. For example, bundle 1 is directly revealed 
preferred to bundle 2 since there is a star in row 1, column 2. And bundle 


Table 

7.3 
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2 is directly revealed preferred to bundle 3, since there is a star in row 2, 
column 3. Therefore bundle 1 is indirectly revealed preferred to bundle 3, 
and we indicate this by putting a star (in parentheses) in row 1, column 3. 

In general, if we have many observations, we will have to look for chains 
of arbitrary length to see if one observation is indirectly revealed preferred 
to another. Although it may not be exactly obvious how to do this, it 
turns out that there are simple computer programs that can calculate the 
indirect revealed preference relation from the table describing the direct 
revealed preference relation. The computer can put a star in location st 
of the table if observation s is revealed preferred to observation t by any 
chain of other observations. 

Once we have done this calculation, we can easily test for SARP. We just 
see if there is a situation where there is a star in row t, column s, and also a 
star in row s, column t. If so, we have found a situation where observation 
t is revealed preferred to observation s, either directly or indirectly, and, 
at the same time, observation s is revealed preferred to observation t. This 
is a violation of the Strong Axiom of Revealed Preference. 

On the other hand, if we do not find such violations, then we know that 
the observations we have are consistent with the economic theory of the 
consumer. These observations could have been made by an optimizing 
consumer with well-behaved preferences. Thus we have a completely op¬ 
erational test for whether or not a particular consumer is acting in a way 
consistent with economic theory. 

This is important, since we can model several kinds of economic units as 
behaving like consumers. Think, for example, of a household consisting of 
several people. Will its consumption choices maximize “household utility” ? 
If we have some data on household consumption choices, we can use the 
Strong Axiom of Revealed Preference to see. Another economic unit that 
we might think of as acting like a consumer is a nonprofit organization 
like a hospital or a university. Do universities maximize a utility func¬ 
tion in making their economic choices? If we have a list of the economic 
choices that a university makes when faced with different prices, we can, 
in principle, answer this kind of question. 


7.8 Index Numbers 

Suppose we examine the consumption bundles of a consumer at two differ¬ 
ent times and we want to compare how consumption has changed from one 
time to the other. Let b stand for the base period, and let t be some other 
time. How does “average” consumption in year t compare to consumption 
in the base period? 

Suppose that at time t prices are (p\,p 2 ) and that the consumer chooses 
(x\,x 2 ). In the base period &, the prices are (p\,p 2 ), and the consumer’s 
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choice is ( x b ,x 2 ). We want to ask how the “average” consumption of the 
consumer has changed. 

If we let u>\ and u >2 be some “weights” that go into making an average, 
then we can look at the following kind of quantity index: 

_ W\x\ + W2X\ 
q w\x\ + U> 2 x\ 

If I q is greater than 1, we can say that the “average” consumption has gone 
up in the movement from b to f; if I q is less than 1, we can say that the 
“average” consumption has gone down. 

The question is, what do we use for the weights? A natural choice is to 
use the prices of the goods in question, since they measure in some sense 
the relative importance of the two goods. But there are two sets of prices 
here: which should we use? 

If we use the base period prices for the weights, we have something called 
a Laspeyres index, and if we use the t period prices, we have something 
called a Paasche index. Both of these indices answer the question of what 
has happened to “average” consumption, but they just use different weights 
in the averaging process. 

Substituting the t period prices for the weights, we see that the Paasche 
quantity index is given by 

p{x\ +p t 2 x t 2 
q p\x\+p\x b 2 ' 

and substituting the b period prices shows that the Laspeyres quantity 
index is given by 

L = v\x\ + P b 2 xl 
q p\x\+p\x b 2 

It turns out that the magnitude of the Laspeyres and Paasche indices can 
tell us something quite interesting about the consumer’s welfare. Suppose 
that we have a situation where the Paasche quantity index is greater than 1: 


Pq = 


p\x\ + p\x t 2 
p\x\ + p\x b 2 


> 1 . 


What can we conclude about how well-off the consumer is at time t as 
compared to his situation at time bl 

The answer is provided by revealed preference. Just cross multiply this 
inequality to give 

p\x £ + p\x\ > p\x\ + p\x \, 

which immediately shows that the consumer must be better off at t than at 
6 , since he could have consumed the b consumption bundle in the t situation 
but chose not to do so. 
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What if the Paasche index is less than 1? Then we would have 


p\x\ + p\x\ < p\x\ + p\x b 2 , 


which says that when the consumer chose bundle (x\, xt 2 ), bundle (x b ,x 2 ) 
was not affordable. But that doesn’t say anything about the consumer’s 
ranking of the bundles. Just because something costs more than you can 
afford doesn’t mean that you prefer it to what you’re consuming now. 

What about the Laspeyres index? It works in a similar way. Suppose 
that the Laspeyres index is less than 1: 


L q 


p\x\ + p\x\ 
p\x\+p\x\ 


Cross multiplying yields 


p\x\ + p\x\ > p\x\ + p b 2 x\, 

which says that (x\, x 2 ) is revealed preferred to (x\,x t 2 ). Thus the consumer 
is better off at time b than at time t. 


7.9 Price Indices 

Price indices work in much the same way. In general, a price index will be 
a weighted average of prices: 

j _ p\w i + p\w 2 
P p\wi+p b 2 W2 

In this case it is natural to choose the quantities as the weights for com¬ 
puting the averages. We get two different indices, depending on our choice 
of weights. If we choose the t period quantities for weights, we get the 

Paasche price index: 


P P = 


p\x\ + pjxl 
p\x\ + p\x\' 


and if we choose the base period quantities we get the Laspeyres price 
index: 


Lp — 


P\x\+P2X2 

p\xX+p\x h 2 


Suppose that the Paasche price index is less than 1; what does revealed 
preference have to say about the welfare situation of the consumer in peri¬ 
ods t and 6? 
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Revealed preference doesn’t say anything at all. The problem is that 
there are now different prices in the numerator and in the denominator of 
the fractions defining the indices, so the revealed preference comparison 
can’t be made. 

Let’s define a new index of the change in total expenditure by 

M _ Pis! + ?44 

p\x\ +p\x b 2 

This is the ratio of total expenditure in period t to the total expenditure 
in period b. 

Now suppose that you are told that the Paasche price index was greater 
than M. This means that 

p = pWi + PW 2 p\A +p\A 

p p\x\+p b x\ p b x b +p b x b 2 

Canceling the numerators from each side of this expression and cross mul¬ 
tiplying, we have 

p\x\ +p 2 x 2 >p\x\ +p b 2 xl. 

This statement says that the bundle chosen at year b is revealed preferred 
to the bundle chosen at year t. This analysis implies that if the Paasche 
price index is greater than the expenditure index, then the consumer must 
be better off in year b than in year t. 

This is quite intuitive. After all, if prices rise by more than income rises 
in the movement from b to f, we would expect that would tend to make the 
consumer worse off. The revealed preference analysis given above confirms 
this intuition. 

A similar statement can be made for the Laspeyres price index. If the 
Laspeyres price index is less than M, then the consumer must be better off 
in year t than in year b. Again, this simply confirms the intuitive idea that 
if prices rise less than income, the consumer would become better off. In 
the case of price indices, what matters is not whether the index is greater 
or less than 1, but whether it is greater or less than the expenditure index. 


EXAMPLE: Indexing Social Security Payments 

Many elderly people have Social Security payments as their sole source 
of income. Because of this, there have been attempts to adjust Social 
Security payments in a way that will keep purchasing power constant even 
when prices change. Since the amount of payments will then depend on the 
movement of some price index or cost-of-living index, this kind of scheme 
is referred to as indexing. 
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Figure 

7.6 


One indexing proposal goes as follows. In some base year b, econo¬ 
mists measure the average consumption bundle of senior citizens. In each 
subsequent year the Social Security system adjusts payments so that the 
“purchasing power” of the average senior citizen remains constant in the 
sense that the average Social Security recipient is just able to afford the 
consumption bundle available in year b , as depicted in Figure 7.6. 



Social Security. Changing prices will typically make the con¬ 
sumer better off than in the base year. 


One curious result of this indexing scheme is that the average senior 
citizen will almost always be better off than he or she was in the base year 
b. Suppose that year b is chosen as the base year for the price index. Then 
the bundle (x\, x%) is the optimal bundle at the prices (p^P;!)- This means 
that the budget line at prices must be tangent to the indifference 

curve through 

Now suppose that prices change. To be specific, suppose that prices 
increase so that the budget line, in the absence of Social Security, would 
shift inward and tilt. The inward shift is due to the increase in prices; the 
tilt is due to the change in relative prices. The indexing program would 
then increase the Social Security payment so as to make the original bundle 
(x\,x\) affordable at the new prices. But this means that the budget line 
would cut the indifference curve, and there would be some other bundle 
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on the budget line that would be strictly preferred to (xi,x|). Thus the 
consumer would typically be able to choose a better bundle than he or she 
chose in the base year. 

Summary 

1. If one bundle is chosen when another could have been chosen, we say 
that the first bundle is revealed preferred to the second. 

2. If the consumer is always choosing the most preferred bundles he or she 
can afford, this means that the chosen bundles must be preferred to the 
bundles that were affordable but weren’t chosen. 

3. Observing the choices of consumers can allow us to “recover” or esti¬ 
mate the preferences that lie behind those choices. The more choices we 
observe, the more precisely we can estimate the underlying preferences that 
generated those choices. 

4. The Weak Axiom of Revealed Preference (WARP) and the Strong Ax¬ 
iom of Revealed Preference (SARP) are necessary conditions that consumer 
choices have to obey if they are to be consistent with the economic model 
of optimizing choice. 


REVIEW QUESTIONS 


1. When prices are (pi,P 2 ) = (1,2) a consumer demands (xi,X 2 ) = (1,2), 
and when prices are (q±, < 72 ) = ( 2 , 1 ) the consumer demands (j/i, J/ 2 ) = ( 2 , 1 ). 
Is this behavior consistent with the model of maximizing behavior? 

2. When prices are (pi,P 2 ) = (2,1) a consumer demands (xi,X 2 ) = (1,2), 
and when prices are (< 71 , < 72 ) = (1, 2 ) the consumer demands (j/i, J/ 2 ) = ( 2 ,1). 
Is this behavior consistent with the model of maximizing behavior? 

3. In the preceding exercise, which bundle is preferred by the consumer, 
the x-bundle or the y-bundle? 

4. We saw that the Social Security adjustment for changing prices would 
typically make recipients at least as well-off as they were at the base year. 
What kind of price changes would leave them just as well-off, no matter 
what kind of preferences they had? 

5. In the same framework as the above question, what kind of preferences 
would leave the consumer just as well-off as he was in the base year, for all 
price changes? 


CHAPTER 



SLUTSKY 

EQUATION 


Economists often are concerned with how a consumer’s behavior changes 
in response to changes in the economic environment. The case we want 
to consider in this chapter is how a consumer’s choice of a good responds 
to changes in its price. It is natural to think that when the price of a 
good rises the demand for it will fall. However, as we saw in Chapter 6 
it is possible to construct examples where the optimal demand for a good 
decreases when its price falls. A good that has this property is called a 
GifFen good. 

Giffen goods are pretty peculiar and are primarily a theoretical curiosity, 
but there are other situations where changes in prices might have “perverse” 
effects that, on reflection, turn out not to be so unreasonable. For example, 
we normally think that if people get a higher wage they will work more. 
But what if your wage went from $10 an hour to $1000 an hour? Would 
you really work more? Might you not decide to work fewer hours and use 
some of the money you’ve earned to do other things? What if your wage 
were $1,000,000 an hour? Wouldn’t you work less? 

For another example, think of what happens to your demand for apples 
when the price goes up. You would probably consume fewer apples. But 
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how about a family who grew apples to sell? If the price of apples went 
up, their income might go up so much that they would feel that they could 
now afford to consume more of their own apples. For the consumers in this 
family, an increase in the price of apples might well lead to an increase in 
the consumption of apples. 

What is going on here? How is it that changes in price can have these 
ambiguous effects on demand? In this chapter and the next we’ll try to 
sort out these effects. 


8.1 The Substitution Effect 

When the price of a good changes, there are two sorts of effects: the rate 
at which you can exchange one good for another changes, and the total 
purchasing power of your income is altered. If, for example, good 1 becomes 
cheaper, it means that you have to give up less of good 2 to purchase good 
1. The change in the price of good 1 has changed the rate at which the 
market allows you to “substitute” good 2 for good 1. The trade-off between 
the two goods that the market presents the consumer has changed. 

At the same time, if good 1 becomes cheaper it means that your money 
income will buy more of good 1. The purchasing power of your money has 
gone up; although the number of dollars you have is the same, the amount 
that they will buy has increased. 

The first part—the change in demand due to the change in the rate 
of exchange between the two goods—is called the substitution effect. 
The second effect—the change in demand due to having more purchasing 
power —is called the income effect. These are only rough definitions of the 
two effects. In order to give a more precise definition we have to consider 
the two effects in greater detail. 

The way that we will do this is to break the price movement into two 
steps: first we will let the relative prices change and adjust money income 
so as to hold purchasing power constant, then we will let purchasing power 
adjust while holding the relative prices constant. 

This is best explained by referring to Figure 8.1. Here we have a situa¬ 
tion where the price of good 1 has declined. This means that the budget 
line rotates around the vertical intercept m/p 2 and becomes flatter. We 
can break this movement of the budget line up into two steps: first pivot 
the budget line around the original demanded bundle and then shift the 
pivoted line out to the new demanded bundle. 

This “pivot-shift” operation gives us a convenient way to decompose 
the change in demand into two pieces. The first step—the pivot—is a 
movement where the slope of the budget line changes while its purchasing 
power stays constant, while the second step is a movement where the slope 
stays constant and the purchasing power changes. This decomposition is 
only a hypothetical construction—the consumer simply observes a change 
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Figure 

8.1 



Pivot and shift. When the price of good 1 changes and income 
stays fixed, the budget line pivots around the vertical axis. We 
will view this adjustment as occurring in two stages: first pivot 
the budget line around the original choice, and then shift this 
line outward to the new demanded bundle. 


in price and chooses a new bundle of goods in response. But in analyzing 
how the consumer’s choice changes, it is useful to think of the budget line 
changing in two stages—first the pivot, then the shift. 

What are the economic meanings of the pivoted and the shifted budget 
lines? Let us first consider the pivoted line. Here we have a budget line with 
the same slope and thus the same relative prices as the final budget line. 
However, the money income associated with this budget line is different, 
since the vertical intercept is different. Since the original consumption 
bundle (ii,X 2 ) lies on the pivoted budget line, that consumption bundle 
is just affordable. The purchasing power of the consumer has remained 
constant in the sense that the original bundle of goods is just affordable at 
the new pivoted line. 

Let us calculate how much we have to adjust money income in order to 
keep the old bundle just affordable. Let m' be the amount of money income 
that will just make the original consumption bundle affordable; this will 
be the amount of money income associated with the pivoted budget line. 
Since (xi,X 2 ) is affordable at both (pi,p 2 ,m) and (p' l ,p 2 ,m'), we have 

m! = p[x i + p 2 X 2 
m = p\Xi + P2X2- 

Subtracting the second equation from the first gives 

m! — m = x 1 [pi — pi]. 
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This equation says that the change in money income necessary to make 
the old bundle affordable at the new prices is just the original amount of 
consumption of good 1 times the change in prices. 

Letting Api = p' x — pi represent the change in price 1, and Am = 
m! — m represent the change in income necessary to make the old bundle 
just affordable, we have 

Am = xiApi. (8-1) 

Note that the change in income and the change in price will always move 
in the same direction: if the price goes up, then we have to raise income to 
keep the same bundle affordable. 

Let’s use some actual numbers. Suppose that the consumer is originally 
consuming 20 candy bars a week, and that candy bars cost 50 cents a piece. 
If the price of candy bars goes up by 10 cents—so that Api = .60 — .50 = 
.10 -how much would income have to change to make the old consumption 
bundle affordable? 

We can apply the formula given above. If the consumer had $2.00 more 
income, he would just be able to consume the same number of candy bars, 
namely, 20. In terms of the formula: 

Am = Api x xi = .10 x 20 = $2.00. 

Now we have a formula for the pivoted budget line: it is just the budget 
line at the new price with income changed by Am. Note that if the price of 
good 1 goes down, then the adjustment in income will be negative. When 
a price goes down, a consumer’s purchasing power goes up, so we will have 
to decrease the consumer’s income in order to keep purchasing power fixed. 
Similarly, when a price goes up, purchasing power goes down, so the change 
in income necessary to keep purchasing power constant must be positive. 

Although (xi,X 2 ) is still affordable, it is not generally the optimal pur¬ 
chase at the pivoted budget line. In Figure 8.2 we have denoted the optimal 
purchase on the pivoted budget line by Y. This bundle of goods is the op¬ 
timal bundle of goods when we change the price and then adjust dollar 
income so as to keep the old bundle of goods just affordable. The move¬ 
ment from X to Y is known as the substitution effect. It indicates how 
the consumer “substitutes” one good for the other when a price changes 
but purchasing power remains constant. 

More precisely, the substitution effect, Ax®, is the change in the demand 
for good 1 when the price of good 1 changes to p[ and, at the same time, 
money income changes to mb 

AiJ = x\(j! x , ml) - x-i (pi, m). 

In order to determine the substitution effect, we must use the consumer’s 
demand function to calculate the optimal choices at (pi,m') and (pi,m). 
The change in the demand for good 1 may be large or small, depending 
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Figure 

8.2 



Substitution Income 

effect effect 


Substitution effect and income effect. The pivot gives the 
substitution effect, and the shift gives the income effect. 


on the shape of the consumer’s indifference curves. But given the demand 
function, it is easy to just plug in the numbers to calculate the substitution 
effect. (Of course the demand for good 1 may well depend on the price of 
good 2; but the price of good 2 is being held constant during this exercise, 
so we’ve left it out of the demand function so as not to clutter the notation.) 

The substitution effect is sometimes called the change in compensated 
demand. The idea is that the consumer is being compensated for a price 
rise by having enough income given back to him to purchase his old bun¬ 
dle. Of course if the price goes down he is “compensated” by having money 
taken away from him. We’ll generally stick with the “substitution” termi¬ 
nology, for consistency, but the “compensation” terminology is also widely 
used. 


EXAMPLE: Calculating the Substitution Effect 

Suppose that the consumer has a demand function for milk of the form 


x\ = 10 + 


10 pi 


Originally his income is $120 per week and the price of milk is $3 per quart. 
Thus his demand for milk will be 10 + 120/(10 x 3) = 14 quarts per week. 
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Now suppose that the price of milk falls to $2 per quart. Then his 
demand at this new price will be 10 +120/(10 x 2) = 16 quarts of milk per 
week. The total change in demand is +2 quarts a week. 

In order to calculate the substitution effect, we must first calculate how 
much income would have to change in order to make the original consump¬ 
tion of milk just affordable when the price of milk is $2 a quart. We apply 
the formula (8.1): 

Am = xiApi = 14 x (2 — 3) = —$14. 

Thus the level of income necessary to keep purchasing power constant 
is m' = m+ Am = 120 — 14 = 106. What is the consumer’s demand for 
milk at the new price, $2 per quart, and this level of income? Just plug 
the numbers into the demand function to find 

. . , 106 
xi(p 1 ,m) = x±(2, 106) = 10 + ^ 2 = 15.3. 

Thus the substitution effect is 

Ax? = xi(2,106) - xi(3,120) = 15.3 - 14 = 1.3. 


8.2 The Income Effect 

We turn now to the second stage of the price adjustment—the shift move¬ 
ment. This is also easy to interpret economically. We know that a parallel 
shift of the budget line is the movement that occurs when income changes 
while relative prices remain constant. Thus the second stage of the price 
adjustment is called the income effect. We simply change the consumer’s 
income from m! to m, keeping the prices constant at (p\ , p 2 ) ■ In Figure 
8.2 this change moves us from the point (j/i, 2 / 2 ) to ( 21 , 22 )- It is natural to 
call this last movement the income effect since all we are doing is changing 
income while keeping the prices fixed at the new prices. 

More precisely, the income effect, Ax", is the change in the demand for 
good 1 when we change income from m' to ro, holding the price of good 1 
fixed at p\: 

Ax" = xi(pi, m) - Xi(p' 1 ,m'). 

We have already considered the income effect earlier in section 6.1. There 
we saw that the income effect can operate either way: it will tend to increase 
or decrease the demand for good 1 depending on whether we have a normal 
good or an inferior good. 

When the price of a good decreases, we need to decrease income in order 
to keep purchasing power constant. If the good is a normal good, then 
this decrease in income will lead to a decrease in demand. If the good is 
an inferior good, then the decrease in income will lead to an increase in 
demand. 
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EXAMPLE: Calculating the Income Effect 

In the example given earlier in this chapter we saw that 

x\ (p'i , m) = xi (2,120) = 16 
Xi(p[, m!) = xi(2, 106) = 15.3. 

Thus the income effect for this problem is 

Ax" = xi(2,120) - xi(2,106) = 16 - 15.3 = 0.7. 

Since milk is a normal good for this consumer, the demand for milk in¬ 
creases when income increases. 


8.3 Sign of the Substitution Effect 


We have seen above that the income effect can be positive or negative, de¬ 
pending on whether the good is a normal good or an inferior good. What 
about the substitution effect? If the price of a good goes down, as in 
Figure 8.2, then the change in the demand for the good due to the substi¬ 
tution effect must be nonnegative. That is, if p± > p\ , then we must have 
x\(pi,m r ) > xi(pi,m), so that Axf > 0. 

The proof of this goes as follows. Consider the points on the pivoted 
budget line in Figure 8.2 where the amount of good 1 consumed is less 
than at the bundle X. These bundles were all affordable at the old prices 
(pi, P 2 ) but they weren’t purchased. Instead the bundle X was purchased. 
If the consumer is always choosing the best bundle he can afford, then X 
must be preferred to all of the bundles on the part of the pivoted line that 
lies inside the original budget set. 

This means that the optimal choice on the pivoted budget line must not 
be one of the bundles that lies underneath the original budget line. The 
optimal choice on the pivoted line would have to be either X or some point 
to the right of X. But this means that the new optimal choice must involve 
consuming at least as much of good 1 as originally, just as we wanted to 
show. In the case illustrated in Figure 8.2, the optimal choice at the pivoted 
budget line is the bundle Y, which certainly involves consuming more of 
good 1 than at the original consumption point, X. 

The substitution effect always moves opposite to the price movement. 
We say that the substitution effect is negative, since the change in demand 
due to the substitution effect is opposite to the change in price: if the price 
increases, the demand for the good due to the substitution effect decreases. 
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8.4 The Total Change in Demand 

The total change in demand, Axi, is the change in demand due to the 
change in price, holding income constant: 

Axi = xi Oi, to) - x\ (pi, m). 

We have seen above how this change can be broken up into two changes: the 
substitution effect and the income effect. In terms of the symbols defined 
above, 


Ax\ = Ax® + Ax” 

xi (pi, to) - xi (pi , to) = [xi (p[, rri) - x x (pi , to)] 

+ [xi(Pi,m) - Xi(p[,m 1 )}. 

In words this equation says that the total change in demand equals the 
substitution effect plus the income effect. This equation is called the Slut¬ 
sky identity . 1 Note that it is an identity: it is true for all values of pi, 
Pi, to , and m'. The first and fourth terms on the right-hand side cancel 
out, so the right-hand side is identically equal to the left-hand side. 

The content of the Slutsky identity is not just the algebraic identity— 
that is a mathematical triviality. The content comes in the interpretation 
of the two terms on the right-hand side: the substitution effect and the 
income effect. In particular, we can use what we know about the signs of 
the income and substitution effects to determine the sign of the total effect. 

While the substitution effect must always be negative—opposite the 
change in the price—the income effect can go either way. Thus the to¬ 
tal effect may be positive or negative. However, if we have a normal good, 
then the substitution effect and the income effect work in the same direc¬ 
tion. An increase in price means that demand will go down due to the 
substitution effect. If the price goes up, it is like a decrease in income, 
which, for a normal good, means a decrease in demand. Both effects rein¬ 
force each other. In terms of our notation, the change in demand due to a 
price increase for a normal good means that 

Axi = Axf + Ax". 

(-) (-) (-) 

(The minus signs beneath each term indicate that each term in this expres¬ 
sion is negative.) 


1 Named for Eugen Slutsky (1880-1948), a Russian economist who investigated demand 
theory. 
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Note carefully the sign on the income effect. Since we are considering 
a situation where the price rises, this implies a decrease in purchasing 
power—for a normal good this will imply a decrease in demand. 

On the other hand, if we have an inferior good, it might happen that the 
income effect outweighs the substitution effect, so that the total change in 
demand associated with a price increase is actually positive. This would 
be a case where 

Axi = Axf + Ax". 

(?) (-) (+) 

If the second term on the right-hand side—the income effect—is large 
enough, the total change in demand could be positive. This would mean 
that an increase in price could result in an increase in demand. This is the 
perverse Giffen case described earlier: the increase in price has reduced the 
consumer’s purchasing power so much that he has increased his consump¬ 
tion of the inferior good. 

But the Slutsky identity shows that this kind of perverse effect can only 
occur for inferior goods: if a good is a normal good, then the income and 
substitution effects reinforce each other, so that the total change in demand 
is always in the “right” direction. 

Thus a Giffen good must be an inferior good. But an inferior good is 
not necessarily a Giffen good: the income effect not only has to be of the 
“wrong” sign, it also has to be large enough to outweigh the “right” sign 
of the substitution effect. This is why Giffen goods are so rarely observed 
in real life: they would not only have to be inferior goods, but they would 
have to be very inferior. 

This is illustrated graphically in Figure 8.3. Here we illustrate the usual 
pivot-shift operation to find the substitution effect and the income effect. 
In both cases, good 1 is an inferior good, and the income effect is therefore 
negative. In Figure 8.3A, the income effect is large enough to outweigh 
the substitution effect and produce a Giffen good. In Figure 8.3B, the 
income effect is smaller, and thus good 1 responds in the ordinary way to 
the change in its price. 


8.5 Rates of Change 

We have seen that the income and substitution effects can be described 
graphically as a combination of pivots and shifts, or they can be described 
algebraically in the Slutsky identity 

Axi = AxJ + Ax", 

which simply says that the total change in demand is the substitution 
effect plus the income effect. The Slutsky identity here is stated in terms 
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B Non-Giffen inferior good 


Inferior goods. Panel A shows a good that is inferior enough 
to cause the Giffen case. Panel B shows a good that is inferior, 
but the effect is not strong enough to create a Giffen good. 


Figure 

8.3 


of absolute changes, but it is more common to express it in terms of rates 
of change. 

When we express the Slutsky identity in terms of rates of change it turns 
out to be convenient to define Ax™ to be the negative of the income effect: 

Ax™ = xi(pi,m') — x\ = —Ax". 

Given this definition, the Slutsky identity becomes 

Axi = Ax\ — Ax™. 

If we divide each side of the identity by A pi, we have 

A %i = Axf _ Ax™ , g 2 > 

Apr Api Api ‘ 

The first term on the right-hand side is the rate of change of demand 
when price changes and income is adjusted so as to keep the old bundle 
affordable—the substitution effect. Let’s work on the second term. Since 
we have an income change in the numerator, it would be nice to get an 
income change in the denominator. 
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Remember that the income change, Am, and the price change, Api, are 
related by the formula 

Am = x±Api. 


Solving for Api we find 


A Am 

Api = -. 

Xi 


Now substitute this expression into the last term in (8.2) to get our final 
formula: 

Axi Axf Ax™ 


A pi A pi Am 


X\. 


This is the Slutsky identity in terms of rates of change. We can interpret 
each term as follows: 


Axi _ 3?i (p[,m) - Xi(pi,m) 

Api Api 

is the rate of change in demand as price changes, holding income fixed; 

Axf _ Xi(p' 11 m') - xi(j>i,m) 

Api Api 


is the rate of change in demand as the price changes, adjusting income so 
as to keep the old bundle just affordable, that is, the substitution effect; 
and 


Ax 7 1 X! (p[ ,m')-x 1 (pi, m) 

-r- Xi = - 7 - xx 

i\m m — m 


(8.3) 


is the rate of change of demand holding prices fixed and adjusting income, 
that is, the income effect. 

The income effect is itself composed of two pieces: how demand changes 
as income changes, times the original level of demand. When the price 
changes by Api, the change in demand due to the income effect is 


Ax™ = 


Xi(pi,n r i') ~ Xi(pi,m) 

Am 


xiApi. 


But this last term, XiApi, is just the change in income necessary to keep 
the old bundle feasible. That is, XiApi = Am, so the change in demand 
due to the income effect reduces to 


Ax™ 


xi(p[,m') - x 1 (p' 1 ,m) 
Am 


Am, 


just as we had before. 
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8.6 The Law of Demand 

In Chapter 5 we voiced some concerns over the fact that consumer theory 
seemed to have no particular content: demand could go up or down when a 
price increased, and demand could go up or down when income increased. 
If a theory doesn’t restrict observed behavior in some fashion it isn’t much 
of a theory. A model that is consistent with all behavior has no real content. 

However, we know that consumer theory does have some content—we’ve 
seen that choices generated by an optimizing consumer must satisfy the 
Strong Axiom of Revealed Preference. Furthermore, we’ve seen that any 
price change can be decomposed into two changes: a substitution effect 
that is sure to be negative—opposite the direction of the price change— 
and an income effect whose sign depends on whether the good is a normal 
good or an inferior good. 

Although consumer theory doesn’t restrict how demand changes when 
price changes or how demand changes when income changes, it does re¬ 
strict how these two kinds of changes interact. In particular, we have the 
following. 

The Law of Demand. If the demand for a good increases when income 
increases, then the demand for that good must decrease when its price in¬ 
creases. 

This follows directly from the Slutsky equation: if the demand increases 
when income increases, we have a normal good. And if we have a normal 
good, then the substitution effect and the income effect reinforce each other, 
and an increase in price will unambiguously reduce demand. 


8.7 Examples of Income and Substitution Effects 

Let’s now consider some examples of price changes for particular kinds of 
preferences and decompose the demand changes into the income and the 
substitution effects. 

We start with the case of perfect complements. The Slutsky decomposi¬ 
tion is illustrated in Figure 8.4. When we pivot the budget line around the 
chosen point, the optimal choice at the new budget line is the same as at 
the old one—this means that the substitution effect is zero. The change in 
demand is due entirely to the income effect. 

What about the case of perfect substitutes, illustrated in Figure 8.5? 
Here when we tilt the budget line, the demand bundle jumps from the 
vertical axis to the horizontal axis. There is no shifting left to do! The 
entire change in demand is due to the substitution effect. 
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Figure 

8.4 



Perfect complements. Slutsky decomposition with perfect 
complements. 


As a third example, let us consider the case of quasilinear preferences. 
This situation is somewhat peculiar. We have already seen that a shift 
in income causes no change in demand for good 1 when preferences are 
quasilinear. This means that the entire change in demand for good 1 is due 
to the substitution effect, and that the income effect is zero, as illustrated 
in Figure 8.6. 


EXAMPLE: Rebating a Tax 

In 1974 the Organization of Petroleum Exporting Countries (OPEC) insti¬ 
tuted an oil embargo against the United States. OPEC was able to stop oil 
shipments to U.S. ports for several weeks. The vulnerability of the United 
States to such disruptions was very disturbing to Congress and the pres¬ 
ident, and there were many plans proposed to reduce the United States’s 
dependence on foreign oil. 

One such plan involved increasing the gasoline tax. Increasing the cost 
of gasoline to the consumers would make them reduce their consumption 
of gasoline, and the reduced demand for gasoline would in turn reduce the 
demand for foreign oil. 

But a straight increase in the tax on gasoline would hit consumers where 
it hurts—in the pocketbook—and by itself such a plan would be politically 
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*1 


Substitution effect = total effect 

Perfect substitutes. Slutsky decomposition with perfect sub¬ 
stitutes. 


Figure 

8.5 


infeasible. So it was suggested that the revenues raised from consumers by 
this tax would be returned to the consumers in the form of direct money 
payments, or via the reduction of some other tax. 

Critics of this proposal argued that paying the revenue raised by the tax 
back to the consumers would have no effect on demand since they could 
just use the rebated money to purchase more gasoline. What does economic 
analysis say about this plan? 

Let us suppose, for simplicity, that the tax on gasoline would end up 
being passed along entirely to the consumers of gasoline so that the price 
of gasoline will go up by exactly the amount of the tax. (In general, only 
part of the tax would be passed along, but we will ignore that complication 
here.) Suppose that the tax would raise the price of gasoline from p to 
p' = p + t, and that the average consumer would respond by reducing 
his demand from x to x'. The average consumer is paying t dollars more 
for gasoline, and he is consuming x' gallons of gasoline after the tax is 
imposed, so the amount of revenue raised by the tax from the average 
consumer would be 

R = tx' = (p' — p)x'. 

Note that the revenue raised by the tax will depend on how much gaso¬ 
line the consumer ends up consuming, x ', not how much he was initially 
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Figure 

8.6 



Quasilinear preferences. In the case of quasilinear prefer¬ 
ences, the entire change in demand is due to the substitution 
effect. 


consuming, x. 

If we let y be the expenditure on all other goods and set its price to be 
1, then the original budget constraint is 

px + y = m, (8-4) 

and the budget constraint in the presence of the tax-rebate plan is 

(p + t)x' + y' = m + tx'. (8.5) 

In budget constraint (8.5) the average consumer is choosing the left-hand 
side variables—the consumption of each good but the right-hand side— 
his income and the rebate from the government—are taken as fixed. The 
rebate depends on what all consumers do, not what the average consumer 
does. In this case, the rebate turns out to be the taxes collected from the 
average consumer—but that’s because he is average, not because of any 
causal connection. 

If we cancel tx' from each side of equation (8.5), we have 

px' + y' = m. 

Thus (x',y') is a bundle that was affordable under the original budget 
constraint and rejected in favor of (x,y). Thus it must be that (x,y) 
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is preferred to the consumers are made worse off by this plan. 

Perhaps that is why it was never put into effect! 

The equilibrium with a rebated tax is depicted in Figure 8.7. The tax 
makes good 1 more expensive, and the rebate increases money income. 
The original bundle is no longer affordable, and the consumer is definitely 
made worse off. The consumer’s choice under the tax-rebate plan involves 
consuming less gasoline and more of “all other goods.” 



Rebating a tax. Taxing a consumer and rebating the tax 
revenues makes the consumer worse off. 


What can we say about the amount of consumption of gasoline? The 
average consumer could afford his old consumption of gasoline, but because 
of the tax, gasoline is now more expensive. In general, the consumer would 
choose to consume less of it. 


EXAMPLE: Voluntary Real Time Pricing 


Figure 

8.7 


Electricity production suffers from an extreme capacity problem: it is rel¬ 
atively cheap to produce up to capacity, at which point it is, by definition, 
impossible to produce more. Building capacity is extremely expensive, so 
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finding ways to reduce the use of electricity during periods of peak demand 
is very attractive from an economic point of view. 

In states with warm climates, such as Georgia, roughly 30 percent of 
usage during periods of peak demand is due to air conditioning. Further¬ 
more, it is relatively easy to forecast temperature one day ahead so that 
potential users will have time to adjust their demand by setting their air 
conditioning to a higher temperature, wearing light clothes, and so on. 
The challenge is to set up a pricing system so that those users who are able 
to cut back on their electricity use will have an incentive to reduce their 
consumption. 

One way to accomplish this is through the use of Real Time Pricing 
(RTP). In a Real Time Pricing program, large industrial users are equipped 
with special meters that allow the price of electricity to vary from minute to 
minute, depending on signals sent from the electricity generating company. 
As the demand for electricity approaches capacity, the generating company 
increases the price so as to encourage users to cut back on their usage. 
The price schedule is determined as a function of the total demand for 
electricity. 

Georgia Power Company claims that it runs the largest real time pric¬ 
ing program in the world. In 1999 it was able to reduce demand by 750 
megawatts on high-price days by inducing some large customers to cut their 
demand by as much as 60 percent. 

Georgia Power has devised several interesting variations on the basic real 
time pricing model. In one pricing plan, customers are assigned a baseline 
quantity, which represents their normal usage. When electricity is in short 
supply and the real time price increases, these users face a higher price for 
electricity use in excess of their baseline quantity. But they also receive a 
rebate if they can manage to cut their electricity use below their baseline 
amount. 

Figure 8.8 shows how this affects the budget line of the users. The 
vertical axis is “money to spend on things other than electricity” and the 
horizontal axis is “electricity use.” In normal times, users choose their 
electricity consumption to maximize utility subject to a budget constraint 
which is determined by the baseline price of electricity. The resulting choice 
is their baseline consumption. 

When the temperature rises, the real time price increases, making elec¬ 
tricity more expensive. But this increase in price is a good thing for users 
who can cut back their consumption, since they receive a rebate based on 
the high real time price for every kilowatt of reduced usage. If usage stays 
at the baseline amount, then the user’s bill will not change. 

It is not hard to see that this pricing plan is a Slutsky pivot around the 
baseline consumption. Thus we can be confident that electricity usage will 
decline, and that users will be at least as well off at the real time price as 
at the baseline price. Indeed, the program has been quite popular, with 
over 1,600 voluntary participants. 
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ELECTRICITY 

Voluntary real time pricing. Users pay higher rates for 
additional electricity when the real time price rises, but they 
also get rebates at the same price if they cut back their use. 

This results in a pivot around the baseline use and tends to 
make the customers better off. 


Figure 

8.8 


8.8 Another Substitution Effect 

The substitution effect is the name that economists give to the change in 
demand when prices change but a consumer’s purchasing power is held 
constant, so that the original bundle remains affordable. At least this is 
one definition of the substitution effect. There is another definition that is 
also useful. 

The definition we have studied above is called the Slutsky substitution 
effect. The definition we will describe in this section is called the Hicks 
substitution effect . 2 

Suppose that instead of pivoting the budget line around the original 
consumption bundle, we now roll the budget line around the indifference 
curve through the original consumption bundle, as depicted in Figure 8.9. 
In this way we present the consumer with a new budget line that has the 
same relative prices as the final budget line but has a different income. The 
purchasing power he has under this budget line will no longer be sufficient to 


2 


The concept is named for Sir John Hicks, an English recipient of the Nobel Prize in 
Economics. 
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purchase his original bundle of goods—but it will be sufficient to purchase 
a bundle that is just indifferent to his original bundle. 


Figure 

8.9 



effect effect 

The Hicks substitution effect. Here we pivot the budget line 
around the indifference curve rather than around the original 
choice. 


Thus the Hicks substitution effect keeps utility constant rather than keep¬ 
ing purchasing power constant. The Slutsky substitution effect gives the 
consumer just enough money to get back to his old level of consumption, 
while the Hicks substitution effect gives the consumer just enough money 
to get back to his old indifference curve. Despite this difference in defini¬ 
tion, it turns out that the Hicks substitution effect must be negative—in 
the sense that it is in a direction opposite that of the price change—just 
like the Slutsky substitution effect. 

The proof is again by revealed preference. Let (xi,X 2 ) be a demanded 
bundle at some prices (pi,P 2 ), and let ( 2/1 , 2/2 ) be a demanded bundle at 
some other prices (<ji, 92 )- Suppose that income is such that the consumer 
is indifferent between (xi, X 2 ) and (yi, y 2 ). Since the consumer is indifferent 
between (xi,X 2 ) and ( 2 / 1 , 2 / 2 ), neither bundle can be revealed preferred to 
the other. 

Using the definition of revealed preference, this means that the following 









COMPENSATED DEMAND CURVES 155 


two inequalities are not true: 

PiXi + p 2 x 2 > piyi + P2V2 

qiUi + Q2U2 > qixi + q 2 X2- 

It follows that these inequalities are true: 

P1X1 + P2X2 < Piyi + P 2 V 2 
qiyi + q2V2 < qixi + q 2 x 2 - 

Adding these inequalities together and rearranging them we have 

(Qi - Pi)( 2 /i - £1) + (q 2 -P2X2/2 - x 2 ) < 0 . 

This is a general statement about how demands change when prices 
change if income is adjusted so as to keep the consumer on the same in¬ 
difference curve. In the particular case we are concerned with, we are only 
changing the first price. Therefore (72 = P 2 , and we are left with 

(qi ~Pi)(yi - xi) < 0 . 

This equation says that the change in the quantity demanded must have 
the opposite sign from that of the price change, which is what we wanted 
to show. 

The total change in demand is still equal to the substitution effect plus 
the income effect—but now it is the Hicks substitution effect. Since the 
Hicks substitution effect is also negative, the Slutsky equation takes exactly 
the same form as we had earlier and has exactly the same interpretation. 
Both the Slutsky and Hicks definitions of the substitution effect have their 
place, and which is more useful depends on the problem at hand. It can 
be shown that for small changes in price, the two substitution effects are 
virtually identical. 


8.9 Compensated Demand Curves 

We have seen how the quantity demanded changes as a price changes in 
three different contexts: holding income fixed (the standard case), holding 
purchasing power fixed (the Slutsky substitution effect), and holding utility 
fixed (the Hicks substitution effect). We can draw the relationship between 
price and quantity demanded holding any of these three variables fixed. 
This gives rise to three different demand curves: the standard demand 
curve, the Slutsky demand curve, and the Hicks demand curve. 

The analysis of this chapter shows that the Slutsky and Hicks demand 
curves are always downward sloping curves. Furthermore the ordinary 
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demand curve is a downward sloping curve for normal goods. However, 
the Giffen analysis shows that it is theoretically possible that the ordinary 
demand curve may slope upwards for an inferior good. 

The Hicksian demand curve—the one with utility held constant—is some¬ 
times called the compensated demand curve. This terminology arises 
naturally if you think of constructing the Hicksian demand curve by ad¬ 
justing income as the price changes so as to keep the consumer’s utility 
constant. Hence the consumer is “compensated” for the price changes, and 
his utility is the same at every point on the Hicksian demand curve. This 
is in contrast to the situation with an ordinary demand curve. In this case 
the consumer is worse off facing higher prices than lower prices since his 
income is constant. 

The compensated demand curve turns out to be very useful in advanced 
courses, especially in treatments of benefit-cost analysis. In this sort of 
analysis it is natural to ask what size payments are necessary to compen¬ 
sate consumers for some policy change. The magnitude of such payments 
gives a useful estimate of the cost of the policy change. However, actual 
calculation of compensated demand curves requires more mathematical ma¬ 
chinery than we have developed in this text. 


Summary 


1. When the price of a good decreases, there will be two effects on consump¬ 
tion. The change in relative prices makes the consumer want to consume 
more of the cheaper good. The increase in purchasing power due to the 
lower price may increase or decrease consumption, depending on whether 
the good is a normal good or an inferior good. 

2. The change in demand due to the change in relative prices is called the 
substitution effect; the change due to the change in purchasing power is 
called the income effect. 

3. The substitution effect is how demand changes when prices change and 
purchasing power is held constant, in the sense that the original bundle 
remains affordable. To hold real purchasing power constant, money income 
will have to change. The necessary change in money income is given by 
Am = aqApi. 

4. The Slutsky equation says that the total change in demand is the sum 
of the substitution effect and the income effect. 

5. The Law of Demand says that normal goods must have downward- 
sloping demand curves. 
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REVIEW QUESTIONS 


1. Suppose a consumer has preferences between two goods that are perfect 
substitutes. Can you change prices in such a way that the entire demand 
response is due to the income effect? 

2. Suppose that preferences are concave. Is it still the case that the substi¬ 
tution effect is negative? 

3. In the case of the gasoline tax, what would happen if the rebate to the 
consumers were based on their original consumption of gasoline, x , rather 
than on their final consumption of gasoline, x'l 

4. In the case described in the preceding question, would the government 
be paying out more or less than it received in tax revenues? 

5. In this case would the consumers be better off or worse off if the tax 
with rebate based on original consumption were in effect? 

APPENDIX 

Let us derive the Slutsky equation using calculus. Consider the Slutsky defini¬ 
tion of the substitution effect, in which the income is adjusted so as to give the 
consumer just enough to buy the original consumption bundle, which we will now 
denote by (xi, 2 : 2 )■ If the prices are (pi,p 2 ), then the consumer’s actual choice 
with this adjustment will depend on (pi,p2) and (xi,X2)- Let’s call this relation¬ 
ship the Slutsky demand function for good 1, and write it as x{(pi,p 2 ,xi,X 2 )- 
Suppose the original demanded bundle is ( 11 , 2 : 2 ) at prices ( p 1 ,p 2 ) and income 
to. The Slutsky demand function tells us what the consumer would demand 
facing some different prices (pi,P2) and having income pixi + P2X2- Thus the 
Slutsky demand function at (pi,P2, xi, X2) is the ordinary demand at {pi,P2) and 
income pixi + P2X2. That is, 

X S 1 (pi,P2,Xl,X2) = Xl(pi,P2,PlXl +P 2 X 2 ). 

This equation says that the Slutsky demand at prices (pi,p2) is that amount 
which the consumer would demand if he had enough income to purchase his 
original bundle of goods (xi, 2 : 2 ). This is just the definition of the Slutsky demand 
function. 

Differentiating this identity with respect to pi , we have 

dx s 1 (pi,p 2 ,xi,x 2 ) _ dxi(pi,p 2 ,m) , dxi(pi,p 2 ,fn) _ 

r\ r\ I r\ X \ . 

Op 1 op 1 om 

Rearranging we have 

dx 1 (p 1 ,p 2 ,m) _ dx\{p 1 ,p 2 ,x\,X 2 ) _ dxi(pi,p 2 ,fn) _ 
dpi dpi dm Xl ' 
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Note the use of the chain rule in this calculation. 

This is a derivative form of the Slutsky equation. It says that the total effect 
of a price change is composed of a substitution effect (where income is adjusted 
to keep the bundle (xi,X 2 ) feasible) and an income effect. We know from the 
text that the substitution effect is negative and that the sign of the income effect 
depends on whether the good in question is inferior or not. As you can see, this 
is just the form of the Slutsky equation considered in the text, except that we 
have replaced the A’s with derivative signs. 

What about the Hicks substitution effect? It is also possible to define a Slutsky 
equation for it. We let Xi(pi,p 2 ,u) be the Hicksian demand function, which 
measures how much the consumer demands of good 1 at prices {pi,P 2 ) if income 
is adjusted to keep the level of utility constant at the original level u. It turns 
out that in this case the Slutsky equation takes the form 

dxi(pi,p 2 , m) _ dxi(pi,p 2 ,u) _ dxi(pi,p 2 ,m) _ 
dpi dpi dm 

The proof of this equation hinges on the fact that 

dxi(pi,p 2 ,u) _ dx 3 i(pi,P 2 ,Xl,X 2 ) 
dpi dpi 

for infinitesimal changes in price. That is, for derivative size changes in price, the 
Slutsky substitution and the Hicks substitution effect are the same. The proof 
of this is not terribly difficult, but it involves some concepts that are beyond 
the scope of this book. A relatively simple proof is given in Hal R. Varian, 
Microeconomic Analysis, 3rd ed. (New York: Norton, 1992). 


EXAMPLE: Rebating a Small Tax 


We can use the calculus version of the Slutsky equation to see how consumption 
choices would react to a small change in a tax when the tax revenues are rebated 
to the consumers. 

Assume, as before, that the tax causes the price to rise by the full amount of 
the tax. Let x be the amount of gasoline, p its original price, and t the amount 
of the tax. Then the change in consumption will be given by 


dx 


dx . 
dp 


t + 


dx 

dm 


tx. 


The first term measures how demand responds to the price change times the 
amount of the price change—which gives us the price effect of the tax. The 
second terms tells us how demand responds to a change in income times the 
amount that income has changed—income has gone up by the amount of the tax 
revenues rebated to the consumer. 

Now use Slutsky’s equation to expand the first term on the right-hand side to 
get the substitution and income effects of the price change itself: 


dx 


dx 3 dx dx dx 3 

~ ^~ tx + ~^~ tx = 

dp dm dm dp 
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The income effect cancels out, and all that is left is the pure substitution effect. 
Imposing a small tax and rebating the revenues of the tax is just like impos¬ 
ing a price change and adjusting income so that the old consumption bundle is 
feasible—as long as the tax is small enough so that the derivative approximation 
is valid. 


CHAPTER 



BUYING AND 
SELLING 


In the simple model of the consumer that we considered in the preceding 
chapters, the income of the consumer was given. In reality people earn their 
income by selling things that they own: items that they have produced, 
assets that they have accumulated, or, most commonly, their own labor. 
In this chapter we will examine how the earlier model must be modified so 
as to describe this kind of behavior. 


9.1 Net and Gross Demands 

As before, we will limit ourselves to the two-good model. We now sup¬ 
pose that the consumer starts off with an endowment of the two goods, 
which we will denote by (wi,^)- 1 This is how much of the two goods the 
consumer has before he enters the market. Think of a farmer who goes 
to market with u>\ units of carrots and u >2 units of potatoes. The farmer 
inspects the prices available at the market and decides how much he wants 
to buy and sell of the two goods. 


1 


The Greek letter cj, omega, is pronounced “o—may-gah.' 
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Let us make a distinction here between the consumer’s gross demands 
and his net demands. The gross demand for a good is the amount of the 
good that the consumer actually ends up consuming: how much of each of 
the goods he or she takes home from the market. The net demand for a 
good is the difference between what the consumer ends up with (the gross 
demand) and the initial endowment of goods. The net demand for a good 
is simply the amount that is bought or sold of the good. 

If we let (xi,x 2 ) be the gross demands, then (aq — oq,x 2 — oj 2 ) are the 
net demands. Note that while the gross demands are typically positive 
numbers, the net demands may be positive or negative. If the net demand 
for good 1 is negative, it means that the consumer wants to consume less 
of good 1 than she has; that is, she wants to supply good 1 to the market. 
A negative net demand is simply an amount supplied. 

For purposes of economic analysis, the gross demands are the more im¬ 
portant, since that is what the consumer is ultimately concerned with. But 
the net demands are what are actually exhibited in the market and thus 
are closer to what the layman means by demand or supply. 


9.2 The Budget Constraint 

The first thing we should do is to consider the form of the budget constraint. 
What constrains the consumer’s final consumption? It must be that the 
value of the bundle of goods that she goes home with must be equal to the 
value of the bundle of goods that she came with. Or, algebraically: 


P 1 X 1 +P 2 X 2 = Pi^i + p 2 u 2 . 


We could just as well express this budget line in terms of net demands as 


pffxi - uq) +p 2 {x 2 - u) 2 ) = 0. 


If (aq — uq) is positive we say that the consumer is a net buyer or net 
demander of good 1; if it is negative we say that she is a net seller or 
net supplier. Then the above equation says that the value of what the 
consumer buys must equal the value of what she sells, which seems sensible 
enough. 

We could also express the budget line when the endowment is present 
in a form similar to the way we described it before. Now it takes two 
equations: 

PiX\ + p 2 x 2 = m 

m = piui + p 2 uj 2 . 

Once the prices are fixed, the value of the endowment, and hence the 
consumer’s money income, is fixed. 


162 BUYING AND SELLING (Ch. 9) 


What does the budget line look like graphically? When we fix the prices, 
money income is fixed, and we have a budget equation just like we had 
before. Thus the slope must be given by —P 1 /P 2 , just as before, so the only 
problem is to determine the location of the line. 

The location of the line can be determined by the following simple obser¬ 
vation: the endowment bundle is always on the budget line. That is, one 
value of (aq, aq) that satisfies the budget line is aq = uq and aq = kq. The 
endowment is always just affordable, since the amount you have to spend 
is precisely the value of the endowment. 

Putting these facts together shows that the budget line has a slope of 
~Pi/p 2 and passes through the endowment point. This is depicted in Fig¬ 
ure 9.1. 


Figure 

9.1 



The budget line. The budget line passes through the endow¬ 
ment and has a slope of —pi/p 2 - 


Given this budget constraint, the consumer can choose the optimal con¬ 
sumption bundle just as before. In Figure 9.1 we have shown an example 
of an optimal consumption bundle Just as before, it will satisfy 

the optimality condition that the marginal rate of substitution is equal to 
the price ratio. 
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In this particular case, x\ > u>\ and x 2 < u> 2 , so the consumer is a net 
buyer of good 1 and a net seller of good 2. The net demands are simply the 
net amounts that the consumer buys or sells of the two goods. In general 
the consumer may decide to be either a buyer or a seller depending on the 
relative prices of the two goods. 


9.3 Changing the Endowment 

In our previous analysis of choice we examined how the optimal consump¬ 
tion changed as the money income changed while the prices remained fixed. 
We can do a similar analysis here by asking how the optimal consumption 
changes as the endowment changes while the prices remain fixed. 

For example, suppose that the endowment changes from (u)\,u> 2 ) to some 
other value (wj,u4) such that 


P1UI+P2U2 > PiU}[ + P 2 ^ 2 - 

This inequality means that the new endowment (wj,u4) is worth less than 
the old endowment—the money income that the consumer could achieve 
by selling her endowment is less. 

This is depicted graphically in Figure 9.2A: the budget line shifts in¬ 
ward. Since this is exactly the same as a reduction in money income, we 
can conclude the same two things that we concluded in our examination of 
that case. First, the consumer is definitely worse off with the endowment 
(wj,a4) than she was with the old endowment, since her consumption pos¬ 
sibilities have been reduced. Second, her demand for each good will change 
according to whether that good is a normal good or an inferior good. 

For example, if good 1 is a normal good and the consumer’s endowment 
changes in a way that reduces its value, we can conclude that the consumer’s 
demand for good 1 will decrease. 

The case where the value of the endowment increases is depicted in Fig¬ 
ure 9.2B. Following the above argument we conclude that if the budget 
line shifts outward in a parallel way, the consumer must be made better 
off. Algebraically, if the endowment changes from (u>i,u> 2 ) to , uj' 2 ) and 
Pi<jJ\ +P 2 U 2 < Pico'i +P 2 ^ 2 , then the consumer’s new budget set must con¬ 
tain her old budget set. This in turn implies that the optimal choice of the 
consumer with the new budget set must be preferred to the optimal choice 
given the old endowment. 

It is worthwhile pondering this point a moment. In Chapter 7 we argued 
that just because a consumption bundle had a higher cost than another 
didn’t mean that it would be preferred to the other bundle. But that 
only holds for a bundle that must be consumed. If a consumer can sell a 
bundle of goods on a free market at constant prices, then she will always 
prefer a higher-valued bundle to a lower-valued bundle, simply because a 


164 BUYING AND SELLING (Ch. 9) 


Figure 

9.2 
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higher-valued bundle gives her more income, and thus more consumption 
possibilities. Therefore, an endowment that has a higher value will always 
be preferred to an endowment with a lower value. This simple observation 
will turn out to have some important implications later on. 

There’s one more case to consider: what happens if piuji+p 2^2 = Piu[ + 
PnJjJrP- Then the budget set doesn’t change at all: the consumer is just 
as well-off with (u>i,u) 2 ) as with {lo\ , a4), and her optimal choice should 
be exactly the same. The endowment has just shifted along the original 
budget line. 


9.4 Price Changes 

Earlier, when we examined how demand changed when price changed, we 
conducted our investigation under the hypothesis that money income re¬ 
mained constant. Now, when money income is determined by the value 
of the endowment, such a hypothesis is unreasonable: if the value of a 
good you are selling changes, your money income will certainly change. 
Thus in the case where the consumer has an endowment, changing prices 
automatically implies changing income. 

Let us first think about this geometrically. If the price of good 1 de¬ 
creases, we know that the budget line becomes flatter. Since the endow¬ 
ment bundle is always affordable, this means that the budget line must 
pivot around the endowment, as depicted in Figure 9.3. 
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Decreasing the price of good 1 . Lowering the price of good 
1 makes the budget line pivot around the endowment. If the 
consumer remains a supplier she must be worse off. 


In this case, the consumer is initially a seller of good 1 and remains a 
seller of good 1 even after the price has declined. What can we say about 
this consumer’s welfare? In the case depicted, the consumer is on a lower 
indifference curve after the price change than before, but will this be true 
in general? The answer comes from applying the principle of revealed 
preference. 

If the consumer remains a supplier, then her new consumption bundle 
must be on the colored part of the new budget line. But this part of the new 
budget line is inside the original budget set: all of these choices were open to 
the consumer before the price changed. Therefore, by revealed preference, 
all of these choices are worse than the original consumption bundle. We can 
therefore conclude that if the price of a good that a consumer is selling goes 
down, and the consumer decides to remain a seller, then the consumer’s 
welfare must have declined. 

What if the price of a good that the consumer is selling decreases and 
the consumer decides to switch to being a buyer of that good? In this case, 
the consumer may be better off or she may be worse off -there is no way 
to tell. 

Let us now turn to the situation where the consumer is a net buyer of a 
good. In this case everything neatly turns around: if the consumer is a net 


Figure 

9.3 
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Figure 

9.4 


buyer of a good, its price increases , and the consumer optimally decides 
to remain a buyer, then she must definitely be worse off. But if the price 
increase leads her to become a seller, it could go either way—she may be 
better off, or she may be worse off. These observations follow from a simple 
application of revealed preference just like the cases described above, but it 
is good practice for you to draw a graph just to make sure you understand 
how this works. 

Revealed preference also allows us to make some interesting points about 
the decision of whether to remain a buyer or to become a seller when prices 
change. Suppose, as in Figure 9.4, that the consumer is a net buyer of good 
1, and consider what happens if the price of good 1 decreases. Then the 
budget line becomes flatter as in Figure 9.4. 



Decreasing the price of good 1 . If a person is a buyer and 
the price of what she is buying decreases, she remains a buyer. 


As usual we don’t know for certain whether the consumer will buy more 
or less of good 1—it depends on her tastes. However, we can say something 
for sure: the consumer will continue to be a net buyer of good 1—she will 
not switch to being a seller. 

How do we know this? Well, consider what would happen if the consumer 
did switch. Then she would be consuming somewhere on the colored part 
of the new budget line in Figure 9.4. But those consumption bundles were 
feasible for her when she faced the original budget line, and she rejected 
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them in favor of So (xi,x|) must be better than any of those 

points. And under the new budget line, (x^x?) is a feasible consumption 
bundle. So whatever she consumes under the new budget line, it must be 
better than (x*,:^)—and thus better than any points on the colored part 
of the new budget line. This implies that her consumption of Xi must 
be to the right of her endowment point—that is, she must remain a net 
demander of good 1. 

Again, this kind of observation applies equally well to a person who is 
a net seller of a good: if the price of what she is selling goes up, she will 
not switch to being a net buyer. We can’t tell for sure if the consumer will 
consume more or less of the good she is selling— but we know that she will 
keep selling it if the price goes up. 


9.5 Offer Curves and Demand Curves 


Recall from Chapter 6 that price offer curves depict those combinations of 
both goods that may be demanded by a consumer and that demand curves 
depict the relationship between the price and the quantity demanded of 
some good. Exactly the same constructions work when the consumer has 
an endowment of both goods. 

Consider, for example, Figure 9.5, which illustrates the price offer curve 
and the demand curve for a consumer. The offer curve will always pass 
through the endowment, because at some price the endowment will be 
a demanded bundle; that is, at some prices the consumer will optimally 
choose not to trade. 

As we’ve seen, the consumer may decide to be a buyer of good 1 for 
some prices and a seller of good 1 for other prices. Thus the offer curve 
will generally pass to the left and to the right of the endowment point. 

The demand curve illustrated in Figure 9.5B is the gross demand curve— 
it measures the total amount the consumer chooses to consume of good 1. 
We have illustrated the net demand curve in Figure 9.6. 

Note that the net demand for good 1 will typically be negative for some 
prices. This will be when the price of good 1 becomes so high that the 
consumer chooses to become a seller of good 1. At some price the consumer 
switches between being a net demander to being a net supplier of good 1. 

It is conventional to plot the supply curve in the positive orthant, al¬ 
though it actually makes more sense to think of supply as just a negative 
demand. We’ll bow to tradition here and plot the net supply curve in the 
normal way—as a positive amount, as in Figure 9.6. 

Algebraically the net demand for good 1, di(pi,p 2 ), is the difference 
between the gross demand Xi(pi,p 2 ) and the endowment of good 1, when 
this difference is positive; that is, when the consumer wants more of the 
good than he or she has: 


di{pi,P2) 


f xi(pi,p 2 ) — ui i if this is positive; 
10 otherwise. 
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Figure 

9.5 
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The offer curve and the demand curve. These are two 
ways of depicting the relationship between the demanded bundle 
and the prices when an endowment is present. 


The net supply curve is the difference between how much the consumer 
has of good 1 and how much he or she wants when this difference is positive: 


Sl{Pl,P 2 ) 


( u>i — xi(pi,P 2 ) if this is positive; 
10 otherwise. 


Everything that we’ve established about the properties of demand behav¬ 
ior applies directly to the supply behavior of a consumer—because supply 
is just negative demand. If the gross demand curve is always downward 
sloping, then the net demand curve will be downward sloping and the sup¬ 
ply curve will be upward sloping. Think about it: if an increase in the 
price makes the net demand more negative, then the net supply will be 
more positive. 


9.6 The Slutsky Equation Revisited 

The above applications of revealed preference are handy, but they don’t 
really answer the main question: how does the demand for a good react to 
a change in its price? We saw in Chapter 8 that if money income was held 
constant, and the good was a normal good, then a reduction in its price 
must lead to an increase in demand. 

The catch is the phrase “money income was held constant.” The case we 
are examining here necessarily involves a change in money income, since 
the value of the endowment will necessarily change when a price changes. 
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Figure 

9.6 


In Chapter 8 we described the Slutsky equation that decomposed the 
change in demand due to a price change into a substitution effect and an 
income effect. The income effect was due to the change in purchasing power 
when prices change. But now, purchasing power has two reasons to change 
when a price changes. The first is the one involved in the definition of the 
Slutsky equation: when a price falls, for example, you can buy just as much 
of a good as you were consuming before and have some extra money left 
over. Let us refer to this as the ordinary income effect. But the second 
effect is new. When the price of a good changes, it changes the value of 
your endowment and thus changes your money income. For example, if 
you are a net supplier of a good, then a fall in its price will reduce your 
money income directly since you won’t be able to sell your endowment for 
as much money as you could before. We will have the same effects that 
we had before, plus an extra income effect from the influence of the prices 
on the value of the endowment bundle. We’ll call this the endowment 
income effect. 

In the earlier form of the Slutsky equation, the amount of money income 
you had was fixed. Now we have to worry about how your money income 
changes as the value of your endowment changes. Thus, when we calculate 
the effect of a change in price on demand, the Slutsky equation will take 
the form: 

total change in demand = change due to substitution effect + change in de¬ 
mand due to ordinary income effect + change in demand due to endowment 
income effect. 
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Gross demand, net demand, and net supply. Using the 
gross demand and net demand to depict the demand and supply 
behavior. 












170 BUYING AND SELLING (Ch. 9) 


The first two effects are familiar. As before, let us use Axi to stand for 
the total change in demand, Axf to stand for the change in demand due 
to the substitution effect, and Ax™ to stand for the change in demand due 
to the ordinary income effect. Then we can substitute these terms into the 
above “verbal equation” to get the Slutsky equation in terms of rates of 
change: 


—— = ——- — x± ——— + endowment income effect. (9.1) 
A pi A pi Am v ' 

What will the last term look like? We’ll derive an explicit expression 
below, but let us first think about what is involved. When the price of the 
endowment changes, money income will change, and this change in money 
income will induce a change in demand. Thus the endowment income effect 
will consist of two terms: 

endowment income effect = change in demand when income changes 
x the change in income when price changes. (9-2) 

Let’s look at the second effect first. Since income is defined to be 


m = piivi + P2U2, 


we have 

Am 

A^i =UJl ' 

This tells us how money income changes when the price of good 1 changes: 
if you have 10 units of good 1 to sell, and its price goes up by $1, your 
money income will go up by $10. 

The first term in equation (9.2) is just how demand changes when income 
changes. We already have an expression for this: it is Ax™/ Am: the change 
in demand divided by the change in income. Thus the endowment income 
effect is given by 

/\ r rV^' /\ TYl. /\ rp TTl 

endowment income effect = 1 —— = . 1 uj\. (9.3) 

Am Api Am y ’ 

Inserting equation (9.3) into equation (9.1) we get the final form of the 
Slutsky equation: 


Axi 

Ap\ 


Ax\ 

Ap\ 


+ (u>i — aq) 


Ax™ 

Am 


This equation can be used to answer the question posed above. We know 
that the sign of the substitution effect is always negative—opposite the 
direction of the change in price. Let us suppose that the good is a normal 
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good, so that Ax™/Am > 0. Then the sign of the combined income effect 
depends on whether the person is a net demander or a net supplier of 
the good in question. If the person is a net demander of a normal good, 
and its price increases, then the consumer will necessarily buy less of it. 
If the consumer is a net supplier of a normal good, then the sign of the 
total effect is ambiguous: it depends on the magnitude of the (positive) 
combined income effect as compared to the magnitude of the (negative) 
substitution effect. 

As before, each of these changes can be depicted graphically, although 
the graph gets rather messy. Refer to Figure 9.7, which depicts the Slutsky 
decomposition of a price change. The total change in the demand for good 1 
is indicated by the movement from A to C. This is the sum of three separate 
movements: the substitution effect, which is the movement from A to B , 
and two income effects. The ordinary income effect, which is the movement 
from B to D , is the change in demand holding money income fixed —that 
is, the same income effect that we examined in Chapter 8. But since the 
value of the endowment changes when prices change, there is now an extra 
income effect: because of the change in the value of the endowment, money 
income changes. This change in money income shifts the budget line back 
inward so that it passes through the endowment bundle. The change in 
demand from D to C measures this endowment income effect. 



The Slutsky equation revisited. Breaking up the effect 
of the price change into the substitution effect (A to B ), the 
ordinary income effect (B to D ), and the endowment income 
effect (D to C). 


Figure 

9.7 
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9.7 Use of the Slutsky Equation 

Suppose that we have a consumer who sells apples and oranges that he 
grows on a few trees in his backyard, like the consumer we described at the 
beginning of Chapter 8. We said there that if the price of apples increased, 
then this consumer might actually consume more apples. Using the Slutsky 
equation derived in this chapter, it is not hard to see why. If we let x a stand 
for the consumer’s demand for apples, and let p a be the price of apples, 
then we know that 


A Xa 

Ap a 


Ap a + ( w « x °) Am ' 

(-) (+) (+) 


This says that the total change in the demand for apples when the price 
of apples changes is the substitution effect plus the income effect. The sub¬ 
stitution effect works in the right direction—increasing the price decreases 
the demand for apples. But if apples are a normal good for this consumer, 
the income effect works in the wrong direction. Since the consumer is a net 
supplier of apples, the increase in the price of apples increases his money 
income so much that he wants to consume more apples due to the income 
effect. If the latter term is strong enough to outweigh the substitution 
effect, we can easily get the “perverse” result. 


EXAMPLE: Calculating the Endowment Income Effect 


Let’s try a little numerical example. Suppose that a dairy farmer produces 
40 quarts of milk a week. Initially the price of milk is $3 a quart. His 
demand function for milk, for his own consumption, is 


X\ = 10 + 


TO 

10 pi' 


Since he is producing 40 quarts at $3 a quart, his income is $120 a week. 
His initial demand for milk is therefore X\ = 14. Now suppose that the 
price of milk changes to $2 a quart. His money income will then change to 
m! = 2 x 40 = $80, and his demand will be x\ = 10 + 80/20 = 14. 

If his money income had remained fixed at to = $120, he would have 
purchased X\ = 10 + 120/10 x 2 = 16 quarts of milk at this price. Thus the 
endowment income effect—the change in his demand due to the change 
in the value of his endowment—is —2. The substitution effect and the 
ordinary income effect for this problem were calculated in Chapter 8. 
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9.8 Labor Supply 

Let us apply the idea of an endowment to analyzing a consumer’s labor 
supply decision. The consumer can choose to work a lot and have rela¬ 
tively high consumption, or can choose to work a little and have a small 
consumption. The amount of consumption and labor will be determined 
by the interaction of the consumer’s preferences and the budget constraint. 


The Budget Constraint 

Let us suppose that the consumer initially has some money income M that 
she receives whether she works or not. This might be income from invest¬ 
ments or from relatives, for example. We call this amount the consumer’s 
nonlabor income. (The consumer could have zero nonlabor income, but 
we want to allow for the possibility that it is positive.) 

Let us use C to indicate the amount of consumption the consumer has, 
and use p to denote the price of consumption. Then letting w be the wage 
rate, and L the amount of labor supplied, we have the budget constraint: 

pC = M + wL. 

This says that the value of what the consumer consumes must be equal to 
her nonlabor income plus her labor income. 

Let us try to compare the above formulation to the previous examples 
of budget constraints. The major difference is that we have something 
that the consumer is choosing—labor supply—on the right-hand side of 
the equation. We can easily transpose it to the left-hand side to get 


pC — wL = M. 


This is better, but we have a minus sign where we normally have a 
plus sign. How can we remedy this? Let us suppose that there is some 
maximum amount of labor supply possible—24 hours a day, 7 days a week, 
or whatever is compatible with the units of measurement we are using. Let 
L denote this amount of labor time. Then adding wL to each side and 
rearranging we have 


pC + w(L — L) = M + wL. 

Let us define C = M/p , the amount of consumption that the consumer 
would have if she didn’t work at all. That is, C is her endowment of 
consumption, so we write 

pC + w(L — L) = pC + wL. 
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Now we have an equation very much like those we’ve seen before. We 
have two choice variables on the left-hand side and two endowment variables 
on the right-hand side. The variable L—L can be interpreted as the amount 
of “leisure”—that is, time that isn’t labor time. Let us use the variable 
R (for relaxation!) to denote leisure, so that R = L — L. Then the total 
amount of time you have available for leisure is R = L and the budget 
constraint becomes 

pC + wR = pC + wR. 

The above equation is formally identical to the very first budget con¬ 
straint that we wrote in this chapter. However, it has a much more inter¬ 
esting interpretation. It says that the value of a consumer’s consumption 
plus her leisure has to equal the value of her endowment of consumption 
and her endowment of time, where her endowment of time is valued at her 
wage rate. The wage rate is not only the price of labor, it is also the price 
of leisure. 

After all, if your wage rate is $10 an hour and you decide to consume 
an extra hour’s leisure, how much does it cost you? The answer is that 
it costs you $10 in forgone income—that’s the price of that extra hour’s 
consumption of leisure. Economists sometimes say that the wage rate is 

the opportunity cost of leisure. 

The right-hand side of this budget constraint is sometimes called the 
consumer’s full income or implicit income. It measures the value of 
what the consumer owns—her endowment of consumption goods, if any, 
and her endowment of her own time. This is to be distinguished from the 
consumer’s measured income, which is simply the income she receives 
from selling off some of her time. 

The nice thing about this budget constraint is that it is just like the ones 
we’ve seen before. It passes through the endowment point (L, C) and has a 
slope of —w/p. The endowment would be what the consumer would get if 
she did not engage in market trade at all, and the slope of the budget line 
tells us the rate at which the market will exchange one good for another. 

The optimal choice occurs where the marginal rate of substitution—the 
tradeoff between consumption and leisure—equals w/p , the real wage, as 
depicted in Figure 9.8. The value of the extra consumption to the consumer 
from working a little more has to be just equal to the value of the lost leisure 
that it takes to generate that consumption. The real wage is the amount 
of consumption that the consumer can purchase if she gives up an hour of 
leisure. 


9.9 Comparative Statics of Labor Supply 

First let us consider how a consumer’s labor supply changes as money 
income changes with the price and wage held fixed. If you won the state 
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Leisure Labor 


Labor supply. The optimal choice describes the demand for 
leisure measured from the origin to the right, and the supply of 
labor measured from the endowment to the left. 


lottery and got a big increase in nonlabor income, what would happen to 
your supply of labor? What would happen to your demand for leisure? 

For most people, the supply of labor would drop when their money in¬ 
come increased. In other words, leisure is probably a normal good for most 
people: when their money income rises, people choose to consume more 
leisure. There seems to be a fair amount of evidence for this observation, 
so we will adopt it as a maintained hypothesis: we will assume that leisure 
is a normal good. 

What does this imply about the response of the consumer’s labor supply 
to changes in the wage rate? When the wage rate increases there are two 
effects: the return to working more increase and the cost of consuming 
leisure increases. By using the ideas of income and substitution effects and 
the Slutsky equation we can isolate these individual effects and analyze 
them. 

When the wage rate increases, leisure becomes more expensive, which by 
itself leads people to want less of it (the substitution effect). Since leisure 
is a normal good, we would then predict that an increase in the wage rate 
would necessarily lead to a decrease in the demand for leisure—that is, an 
increase in the supply of labor. This follows from the Slutsky equation 
given in Chapter 8. A normal good must have a negatively sloped demand 
curve. If leisure is a normal good, then the supply curve of labor must be 
positively sloped. 


Figure 

9.8 
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But there is a problem with this analysis. First, at an intuitive level, it 
does not seem reasonable that increasing the wage would always result in 
an increased supply of labor. If my wage becomes very high, I might well 
“spend” the extra income in consuming leisure. How can we reconcile this 
apparently plausible behavior with the economic theory given above? 

If the theory gives the wrong answer, it is probably because we’ve mis¬ 
applied the theory. And indeed in this case we have. The Slutsky example 
described earlier gave the change in demand holding money income con¬ 
stant. But if the wage rate changes, then money income must change as 
well. The change in demand resulting from a change in money income is 
an extra income effect—the endowment income effect. It occurs on top of 
the ordinary income effect. 

If we apply the appropriate version of the Slutsky equation given earlier 
in this chapter, we get the following expression: 


= substitution effect + (R — R) ■ 

(-) (+) (+) 


(9.4) 


In this expression the substitution effect is definitely negative, as it al¬ 
ways is, and AR/Am is positive since we are assuming that leisure is a 
normal good. But (R — R) is positive as well, so the sign of the whole 
expression is ambiguous. Unlike the usual case of consumer demand, the 
demand for leisure will have an ambiguous sign, even if leisure is a normal 
good. As the wage rate increases, people may work more or less. 

Why does this ambiguity arise? When the wage rate increases, the substi¬ 
tution effect says work more in order to substitute consumption for leisure. 
But when the wage rate increases, the value of the endowment goes up as 
well. This is just like extra income, which may very well be consumed in 
taking extra leisure. Which is the larger effect is an empirical matter and 
cannot be decided by theory alone. We have to look at people’s actual 
labor supply decisions to determine which effect dominates. 

The case where an increase in the wage rate results in a decrease in the 
supply of labor is represented by a backward-bending labor supply 
curve. The Slutsky equation tells us that this effect is more likely to occur 
the larger is (R — R), that is, the larger is the supply of labor. When 
R = R : the consumer is consuming only leisure, so an increase in the wage 
will result in a pure substitution effect and thus an increase in the supply 
of labor. But as the labor supply increases, each increase in the wage gives 
the consumer additional income for all the hours he is working, so that 
after some point he may well decide to use this extra income to “purchase” 
additional leisure—that is, to reduce his supply of labor. 

A backward-bending labor supply curve is depicted in Figure 9.9. When 
the wage rate is small, the substitution effect is larger than the income 
effect, and an increase in the wage will decrease the demand for leisure and 
hence increase the supply of labor. But for larger wage rates the income 
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effect may outweigh the substitution effect, and an increase in the wage 
will reduce the supply of labor. 




B Labor supply curve 


Backward-bending labor supply. As the wage rate in¬ 
creases, the supply of labor increases from L\ to L 2 . But a 
further increase in the wage rate reduces the supply of labor 
back to L\. 


Figure 

9.9 


EXAMPLE: Overtime and the Supply of Labor 

Consider a worker who has chosen to supply a certain amount of labor 
L* = R — R* when faced with the wage rate w as depicted in Figure 9.10. 
Now suppose that the firm offers him a higher wage, w' > w, for extra time 
that he chooses to work. Such a payment is known as an overtime wage. 

In terms of Figure 9.10, this means that the slope of the budget line will 
be steeper for labor supplied in excess of L*. But then we know that the 
worker will optimally choose to supply more labor, by the usual sort of 
revealed preference argument: the choices involving working less than L* 
were available before the overtime was offered and were rejected. 

Note that we get an unambiguous increase in labor supply with an over¬ 
time wage, whereas just offering a higher wage for all hours worked has an 
ambiguous effect—as discussed above, labor supply may increase or it may 
decrease. The reason is that the response to an overtime wage is essentially 
a pure substitution effect—the change in the optimal choice resulting from 

















178 BUYING AND SELLING (Ch. 9) 


Figure 

9.10 



Overtime versus an ordinary wage increase. An increase 
in the overtime wage definitely increases the supply of labor, 
while an increase in the straight wage could decrease the supply 
of labor. 


pivoting the budget line around the chosen point. Overtime gives a higher 
payment for the extra hours worked, whereas a straight increase in the wage 
gives a higher payment for all hours worked. Thus a straight-wage increase 
involves both a substitution and an income effect while an overtime-wage 
increase results in a pure substitution effect. An example of this is shown in 
Figure 9.10. There an increase in the straight wage results in a decrease in 
labor supply, while an increase in the overtime wage results in an increase 
in labor supply. 


Summary 


1. Consumers earn income by selling their endowment of goods. 

2. The gross demand for a good is the amount that the consumer ends up 
consuming. The net demand for a good is the amount the consumer buys. 
Thus the net demand is the difference between the gross demand and the 
endowment. 
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3. The budget constraint has a slope of —p\/p 2 and passes through the 
endowment bundle. 

4. When a price changes, the value of what the consumer has to sell will 
change and thereby generate an additional income effect in the Slutsky 
equation. 

5. Labor supply is an interesting example of the interaction of income and 
substitution effects. Due to the interaction of these two effects, the response 
of labor supply to a change in the wage rate is ambiguous. 


REVIEW QUESTIONS 


1. If a consumer’s net demands are (5,-3) and her endowment is (4,4), 
what are her gross demands? 

2. The prices are (p\,P 2 ) = (2,3), and the consumer is currently consuming 
(xi, X 2 ) = (4,4). There is a perfect market for the two goods in which they 
can be bought and sold costlessly. Will the consumer necessarily prefer 
consuming the bundle (j/i, 7 / 2 ) = (3,5)? Will she necessarily prefer having 
the bundle ( 2 / 1 , 2 / 2 )? 

3. The prices are (p\,P 2 ) — (2,3), and the consumer is currently consuming 
(xi,X 2 ) = (4,4). Now the prices change to ( 91 , 52 ) = (2,4). Could the 
consumer be better off under these new prices? 

4. The U.S. currently imports about half of the petroleum that it uses. The 
rest of its needs are met by domestic production. Could the price of oil rise 
so much that the U.S. would be made better off? 

5. Suppose that by some miracle the number of hours in the day increased 
from 24 to 30 hours (with luck this would happen shortly before exam 
week). How would this affect the budget constraint? 

6. If leisure is an inferior good, what can you say about the slope of the 
labor supply curve? 


APPENDIX 

The derivation of the Slutsky equation in the text contained one bit of hand 
waving. When we considered how changing the monetary value of the endowment 
affects demand, we said that it was equal to Ax™/Am. In our old version of the 
Slutsky equation this was the rate of change in demand when income changed 
so as to keep the original consumption bundle affordable. But that will not 
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necessarily be equal to the rate of change of demand when the value of the 
endowment changes. Let’s examine this point in a little more detail. 

Let the price of good 1 change from pi to p \, and use m" to denote the new 
money income at the price p'i due to the change in the value of the endowment. 
Suppose that the price of good 2 remains fixed so we can omit it as an argument 
of the demand function. 

By definition of m", we know that 

m" — m = Apiu>i. 

Note that it is identically true that 

xi - xi(pi,m) _ 

Api 

+ x 1 (j/ 1 ,m')-x 1 (pi,m) (substitutioneffect) 

Api 

-—- (ordinary income etiectj 

X! (p'i, m") — x\ (p'i, m) 

H--- (endowment income effect). 

Api 

(Just cancel out identical terms with opposite signs on the right-hand side.) 

By definition of the ordinary income effect, 

. m! — m 
Api = - 

Xl 

and by definition of the endowment income effect, 

. m" — m 
Api = -. 

OJl 

Making these replacements gives us a Slutsky equation of the form 

- xi(pi,m) _ 

Api. 

xi(p\ ,m') — xi(pi,m) , , „ , 

-|--—-----—-- (substitution effect) 

Api 

xi(Pi,m') -xi(p[,m) . 

- xi (ordinary income effect) 

m! — m 

x\{pi , m") — xi (pi, to) . . 

-|--- - - -u>i (endowment income effect). 

m" — m 

Writing this in terms of As, we have 

Axi Axi Ax? Ax'? 

~7 - = —7 -7-Xl “I-7- U>1. 

Api Api Am Am 
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The only new term here is the last one. It tells how the demand for good 1 
changes as income changes, times the endowment of good 1. This is precisely the 
endowment income effect. 

Suppose that we are considering a very small price change, and thus a small 
associated income change. Then the fractions in the two income effects will be 
virtually the same, since the rate of change of good 1 when income changes from 
m to m' should be about the same as when income changes from m to m". For 
such small changes we can collect terms and write the last two terms—the income 
effects—as 


Ax? 

Am 


(wi - Xl), 


which yields a Slutsky equation of the same form as that derived earlier: 


Axi Axf . . Ax? 

Api ~ Api + ^ ^ Am' 


If we want to express the Slutsky equation in calculus terms, we can just take 
limits in this expression. Or, if you prefer, we can calculate the correct equation 
directly, just by taking partial derivatives. Let xi(pi, m(pi)) be the demand 
function for good 1 where we hold price 2 fixed and recognize that money income 
depends on the price of good 1 via the relationship m(pi) = piuJi + P 2 U> 2 - Then 
we can write 


dxi(pi,m(pi)) _ dx 1 (p 1 ,m) dx 1 (p 1 ,m) dm(p?) 


(9.5) 


dpi dpi dm dpi 

By the definition of m(p i) we know how income changes when price changes: 


dm(pi) 

dpi 


= Wl, 


(9.6) 


and by the Slutsky equation we know how demand changes when price changes, 
holding money income fixed: 


dxi(pi,m) 

dpi 


dxl(pi) _ dx(pi,m) 
dpi dm Xl 


(9.7) 


Inserting equations (9.6) and (9.7) into equation (9.5) we have 


dxi(pi,m(pi)) 

dpi 


dxjjpi) 

dpi 


+ 


dx(pi,m) 

dm 


(wi - Xl), 


which is the form of the Slutsky equation that we want. 














CHAPTER 



INTERTEMPORAL 

CHOICE 


In this chapter we continue our examination of consumer behavior by con¬ 
sidering the choices involved in saving and consuming over time. Choices 
of consumption over time are known as intertemporal choices. 


10.1 The Budget Constraint 


Let us imagine a consumer who chooses how much of some good to consume 
in each of two time periods. We will usually want to think of this good 
as being a composite good, as described in Chapter 2, but you can think 
of it as being a specific commodity if you wish. We denote the amount 
of consumption in each period by ( 01 , 02 ) and suppose that the prices of 
consumption in each period are constant at 1. The amount of money the 
consumer will have in each period is denoted by ( 7711 , 7712 ). 

Suppose initially that the only way the consumer has of transferring 
money from period 1 to period 2 is by saving it without earning interest. 
Furthermore let us assume for the moment that he has no possibility of 
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Budget constraint. This is the budget constraint when the 
rate of interest is zero and no borrowing is allowed. The less 
the individual consumes in period 1, the more he can consume 
in period 2. 


borrowing money, so that the most he can spend in period 1 is m\. His 
budget constraint will then look like the one depicted in Figure 10.1. 

We see that there will be two possible kinds of choices. The consumer 
could choose to consume at (mi, m 2 ), which means that he just consumes 
his income each period, or he can choose to consume less than his income 
during the first period. In this latter case, the consumer is saving some of 
his first-period consumption for a later date. 

Now, let us allow the consumer to borrow and lend money at some 
interest rate r. Keeping the prices of consumption in each period at 1 for 
convenience, let us derive the budget constraint. Suppose first that the 
consumer decides to be a saver so his first period consumption, c\, is less 
than his first-period income, m\. In this case he will earn interest on the 
amount he saves, mi — ci, at the interest rate r. The amount that he can 
consume next period is given by 


c 2 = m 2 + (mi — Ci) + r(mi — ci) 

= m 2 + (1 + r)(mi - ci). (10.1) 

This says that the amount that the consumer can consume in period 2 is 
his income plus the amount he saved from period 1, plus the interest that 
he earned on his savings. 

Now suppose that the consumer is a borrower so that his first-period 
consumption is greater than his first-period income. The consumer is a 


Figure 

10.1 
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borrower if Ci > m\, and the interest he has to pay in the second period 
will be r(ci — m i). Of course, he also has to pay back the amount that he 
borrowed, Ci — m\. This means his budget constraint is given by 

C 2 = m 2 — r(ci — mi) — (d — mi) 

= rn 2 + (1 + r)(mi — ci), 

which is just what we had before. If mi — Ci is positive, then the consumer 
earns interest on this savings; if mi — Ci is negative, then the consumer 
pays interest on his borrowings. 

If ci = mi, then necessarily C 2 = m 2 , and the consumer is neither a 
borrower nor a lender. We might say that this consumption position is the 
“Polonius point.” 1 

We can rearrange the budget constraint for the consumer to get two 
alternative forms that are useful: 


(1 + r)ci + c 2 = (1 + r)mi + m 2 


and 

C 2 , rn 2 

Cl + TV =mi + ——. 
1 + r 1 + r 

Note that both equations have the form 


( 10 . 2 ) 

(10.3) 


P 1 X 1 + p 2 x 2 = pimi + p 2 m 2 . 

In equation (10.2), p\ = 1 + r and p 2 = 1. In equation (10.3), pi = 1 and 
P 2 = 1/(1 + r). 

We say that equation (10.2) expresses the budget constraint in terms of 
future value and that equation (10.3) expresses the budget constraint in 
terms of present value. The reason for this terminology is that the first 
budget constraint makes the price of future consumption equal to 1, while 
the second budget constraint makes the price of present consumption equal 
to 1. The first budget constraint measures the period-1 price relative to 
the period-2 price, while the second equation does the reverse. 

The geometric interpretation of present value and future value is given in 
Figure 10.2. The present value of an endowment of money in two periods is 
the amount of money in period 1 that would generate the same budget set 
as the endowment. This is just the horizontal intercept of the budget line, 
which gives the maximum amount of first-period consumption possible. 


1 “Neither a borrower, nor a lender be; For loan oft loses both itself and friend, And 
borrowing dulls the edge of husbandry.” Hamlet , Act I, scene m; Polonius giving 
advice to his son. 
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m, m, + /77 2 /(1 + r) Q 

(present value) 


Present and future values. The vertical intercept of the 
budget line measures future value, and the horizontal intercept 
measures the present value. 


Examining the budget constraint, this amount is c\ = m\ + 7712/(1 + r), 
which is the present value of the endowment. 

Similarly, the vertical intercept is the maximum amount of second-period 
consumption, which occurs when c\ = 0. Again, from the budget con¬ 
straint, we can solve for this amount C 2 = (1 + r)mi + 7712 , the future value 
of the endowment. 

The present-value form is the more important way to express the in¬ 
tertemporal budget constraint since it measures the future relative to the 
present, which is the way we naturally look at it. 

It is easy from any of these equations to see the form of this budget 
constraint. The budget line passes through ( 7711 , 7772 ), since that is always 
an affordable consumption pattern, and the budget line has a slope of 
-(1 + r). 


10.2 Preferences for Consumption 

Let us now consider the consumer’s preferences, as represented by his in¬ 
difference curves. The shape of the indifference curves indicates the con¬ 
sumer’s tastes for consumption at different times. If we drew indifference 
curves with a constant slope of —1, for example, they would represent tastes 
of a consumer who didn’t care whether he consumed today or tomorrow. 
His marginal rate of substitution between today and tomorrow is —1. 


Figure 

10.2 
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Figure 

10.3 


If we drew indifference curves for perfect complements, this would in¬ 
dicate that the consumer wanted to consume equal amounts today and 
tomorrow. Such a consumer would be unwilling to substitute consumption 
from one time period to the other, no matter what it might be worth to 
him to do so. 

As usual, the intermediate case of well-behaved preferences is the more 
reasonable situation. The consumer is willing to substitute some amount of 
consumption today for consumption tomorrow, and how much he is willing 
to substitute depends on the particular pattern of consumption that he 
has. 

Convexity of preferences is very natural in this context, since it says that 
the consumer would rather have an “average” amount of consumption each 
period rather than have a lot today and nothing tomorrow or vice versa. 


10.3 Comparative Statics 

Given a consumer’s budget constraint and his preferences for consumption 
in each of the two periods, we can examine the optimal choice of consump¬ 
tion ( 01 , 02 ). If the consumer chooses a point where ci < mi, we will say 
that she is a lender, and if Ci > mi, we say that she is a borrower. In 
Figure 10.3A we have depicted a case where the consumer is a borrower, 
and in Figure 10.3B we have depicted a lender. 




A Borrower B Lender 


Borrower and lender. Panel A depicts a borrower, since 
Ci > mi, and panel B depicts a lender, since Ci < mi- 


Let us now consider how the consumer would react to a change in the 
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interest rate. From equation (10.1) we see that increasing the rate of inter¬ 
est must tilt the budget line to a steeper position: for a given reduction in 
Ci you will get more consumption in the second period if the interest rate 
is higher. Of course the endowment always remains affordable, so the tilt 
is really a pivot around the endowment. 

We can also say something about how the choice of being a borrower 
or a lender changes as the interest rate changes. There are two cases, 
depending on whether the consumer is initially a borrower or initially a 
lender. Suppose first that he is a lender. Then it turns out that if the 
interest rate increases, the consumer must remain a lender. 

This argument is illustrated in Figure 10.4. If the consumer is initially a 
lender, then his consumption bundle is to the left of the endowment point. 
Now let the interest rate increase. Is it possible that the consumer shifts 
to a new consumption point to the right of the endowment? 

No, because that would violate the principle of revealed preference: 
choices to the right of the endowment point were available to the con¬ 
sumer when he faced the original budget set and were rejected in favor of 
the chosen point. Since the original optimal bundle is still available at the 
new budget line, the new optimal bundle must be a point outside the old 
budget set—which means it must be to the left of the endowment. The 
consumer must remain a lender when the interest rate increases. 

There is a similar effect for borrowers: if the consumer is initially a 
borrower, and the interest rate declines, he or she will remain a borrower. 
(You might sketch a diagram similar to Figure 10.4 and see if you can spell 
out the argument.) 

Thus if a person is a lender and the interest rate increases, he will remain 
a lender. If a person is a borrower and the interest rate decreases, he will 
remain a borrower. On the other hand, if a person is a lender and the 
interest rate decreases, he may well decide to switch to being a borrower; 
similarly, an increase in the interest rate may induce a borrower to become 
a lender. Revealed preference tells us nothing about these last two cases. 

Revealed preference can also be used to make judgments about how the 
consumer’s welfare changes as the interest rate changes. If the consumer 
is initially a borrower, and the interest rate rises, but he decides to remain 
a borrower, then he must be worse off at the new interest rate. This argu¬ 
ment is illustrated in Figure 10.5; if the consumer remains a borrower, he 
must be operating at a point that was affordable under the old budget set 
but was rejected, which implies that he must be worse off. 


10.4 The Slutsky Equation and Intertemporal Choice 


The Slutsky equation can be used to decompose the change in demand due 
to an interest rate change into income effects and substitution effects, just 


188 INTERTEMPORAL CHOICE (Ch. 10) 


Figure 

10.4 



m, Cj 


If a person is a lender and the interest rate rises, he or 
she will remain a lender. Increasing the interest rate pivots 
the budget line around the endowment to a steeper position; 
revealed preference implies that the new consumption bundle 
must lie to the left of the endowment. 


as in Chapter 9. Suppose that the interest rate rises. What will be the 
effect on consumption in each period? 

This is a case that is easier to analyze by using the future-value budget 
constraint, rather than the present-value constraint. In terms of the future- 
value budget constraint, raising the interest rate is just like raising the price 
of consumption today as compared to consumption tomorrow. Writing out 
the Slutsky equation we have 

^ Ac s ! Ac? 

+ ( TO i - d) “Srn ' 

(?) (-) ( ? ) (+) 

The substitution effect, as always, works opposite the direction of price. 
In this case the price of period-1 consumption goes up, so the substitution 
effect says the consumer should consume less first period. This is the 
meaning of the minus sign under the substitution effect. Let’s assume that 
consumption this period is a normal good, so that the very last term—how 
consumption changes as income changes—will be positive. So we put a 
plus sign under the last term. Now the sign of the whole expression will 
depend on the sign of (mi — ci). If the person is a borrower, this term 
will be negative and the whole expression will therefore unambiguously be 
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New 
consumption 



A borrower is made worse off by an increase in the inter¬ 
est rate. When the interest rate facing a borrower increases 
and the consumer chooses to remain a borrower, he or she is 
certainly worse off. 


Figure 

10.5 


negative—for a borrower, an increase in the interest rate must lower today’s 
consumption. 

Why does this happen? When the interest rate rises, there is always 
a substitution effect towards consuming less today. For a borrower, an 
increase in the interest rate means that he will have to pay more interest 
tomorrow. This effect induces him to borrow less, and thus consume less, 
in the first period. 

For a lender the effect is ambiguous. The total effect is the sum of a neg¬ 
ative substitution effect and a positive income effect. From the viewpoint 
of a lender an increase in the interest rate may give him so much extra 
income that he will want to consume even more first period. 

The effects of changing interest rates are not terribly mysterious. There 
is an income effect and a substitution effect as in any other price change. 
But without a tool like the Slutsky equation to separate out the various 
effects, the changes may be hard to disentangle. With such a tool, the 
sorting out of the effects is quite straightforward. 


10.5 Inflation 


The above analysis has all been conducted in terms of a general “consump- 
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tion” good. Giving up Ac units of consumption today buys you (1 + r)Ac 
units of consumption tomorrow. Implicit in this analysis is the assumption 
that the “price” of consumption doesn’t change—there is no inflation or 
deflation. 

However, the analysis is not hard to modify to deal with the case of infla¬ 
tion. Let us suppose that the consumption good now has a different price 
in each period. It is convenient to choose today’s price of consumption as 
1 and to let P 2 be the price of consumption tomorrow. It is also convenient 
to think of the endowment as being measured in units of the consumption 
goods as well, so that the monetary value of the endowment in period 2 is 
P 2 W 2 . Then the amount of money the consumer can spend in the second 
period is given by 


P2C2 = P 2 TO 2 + (1 + i~)(toi - ci), 


and the amount of consumption available second period is 


c 2 = m 2 


1 — /■ 

-( m i - Ci). 

P2 


Note that this equation is very similar to the equation given earlier —we 
just use (1 + r)/p 2 rather than 1 + r. 

Let us express this budget constraint in terms of the rate of inflation. 
The inflation rate, w, is just the rate at which prices grow. Recalling that 
Pi = 1, we have 

P2 = 1 + 7T, 

which gives us 

1 -| - T 

02 = 1712 + — (TO 1 - Ci). 

1 + 7T 

Let’s create a new variable /?, the real interest rate, and define it by 2 


1 + P 


1 + r 

1 + 7T 


so that the budget constraint becomes 


C 2 — to 2 T (1 T p)(toi — Ci). 


One plus the real interest rate measures how much extra consumption you 
can get in period 2 if you give up some consumption in period 1. That 
is why it is called the real rate of interest: it tells you how much extra 
consumption you can get, not how many extra dollars you can get. 


2 


The Greek letter p, rho, is pronounced “row. : 
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The interest rate on dollars is called the nominal rate of interest. As 
we’ve seen above, the relationship between the two is given by 


1 + P 


1 + r 

1 + 7T 


In order to get an explicit expression for p, we write this equation as 
1 + r 1 + r 1 + 7T 

p = -1 =- 

1 + 7T 1 + 7T 1 + 7T 

r — 7r 
1 + 7T 


This is an exact expression for the real interest rate, but it is common to 
use an approximation. If the inflation rate isn’t too large, the denominator 
of the fraction will be only slightly larger than 1. Thus the real rate of 
interest will be approximately given by 


p « r — 7r, 

which says that the real rate of interest is just the nominal rate minus the 
rate of inflation. (The symbol ss means “approximately equal to.”) This 
makes perfectly good sense: if the interest rate is 18 percent, but prices 
are rising at 10 percent, then the real interest rate—the extra consumption 
you can buy next period if you give up some consumption now—will be 
roughly 8 percent. 

Of course, we are always looking into the future when making consump¬ 
tion plans. Typically, we know the nominal rate of interest for the next 
period, but the rate of inflation for next period is unknown. The real inter¬ 
est rate is usually taken to be the current interest rate minus the expected 
rate of inflation. To the extent that people have different estimates about 
what the next year’s rate of inflation will be, they will have different esti¬ 
mates of the real interest rate. If inflation can be reasonably well forecast, 
these differences may not be too large. 


10.6 Present Value: A Closer Look 


Let us return now to the two forms of the budget constraint described 
earlier in section 10.1 in equations (10.2) and (10.3): 

(1 + r)ci + c 2 = (1 + r)mi + m 2 


and 


Cl + 


_C2_ 

1 + r 


m 2 


= m i + 


1 + r 
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Consider just the right-hand sides of these two equations. We said that 
the first one expresses the value of the endowment in terms of future value 
and that the second one expresses it in terms of present value. 

Let us examine the concept of future value first. If we can borrow and 
lend at an interest rate of r, what is the future equivalent of $1 today? 
The answer is (1 + r) dollars. That is, $1 today can be turned into (1 + r) 
dollars next period simply by lending it to the bank at an interest rate r. 
In other words, (1 + r) dollars next period is equivalent to $1 today since 
that is how much you would have to pay next period to purchase—that is, 
borrow—$1 today. The value (1 + r) is just the price of $1 today, relative 
to $1 next period. This can be easily seen from the first budget constraint: 
it is expressed in terms of future dollars—the second-period dollars have a 
price of 1, and first-period dollars are measured relative to them. 

What about present value? This is just the reverse: everything is mea¬ 
sured in terms of today’s dollars. How much is a dollar next period worth 
in terms of a dollar today? The answer is 1/(1-l-r) dollars. This is because 
1/(1 + r) dollars can be turned into a dollar next period simply by saving 
it at the rate of interest r. The present value of a dollar to be delivered 
next period is 1/(1 + r). 

The concept of present value gives us another way to express the budget 
for a two-period consumption problem: a consumption plan is affordable if 
the present value of consumption equals the present value of income. 

The idea of present value has an important implication that is closely 
related to a point made in Chapter 9: if the consumer can freely buy and sell 
goods at constant prices, then the consumer would always prefer a higher¬ 
valued endowment to a lower-valued one. In the case of intertemporal 
decisions, this principle implies that if a consumer can freely borrow and 
lend at a constant interest rate, then the consumer would always prefer a 
pattern of income with a higher present value to a pattern with a lower 
present value. 

This is true for the same reason that the statement in Chapter 9 was 
true: an endowment with a higher value gives rise to a budget line that is 
farther out. The new budget set contains the old budget set, which means 
that the consumer would have all the consumption opportunities she had 
with the old budget set plus some more. Economists sometimes say that 
an endowment with a higher present value dominates one with a lower 
present value in the sense that the consumer can have larger consumption 
in every period by selling the endowment with the higher present value 
that she could get by selling the endowment with the lower present value. 

Of course, if the present value of one endowment is higher than another, 
then the future value will be higher as well. However, it turns out that the 
present value is a more convenient way to measure the purchasing power 
of an endowment of money over time, and it is the measure to which we 
will devote the most attention. 
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10.7 Analyzing Present Value for Several Periods 

Let us consider a three-period model. We suppose that we can borrow or 
lend money at an interest rate r each period and that this interest rate will 
remain constant over the three periods. Thus the price of consumption in 
period 2 in terms of period-1 consumption will be 1/(1+ r), just as before. 

What will the price of period-3 consumption be? Well, if I invest $1 
today, it will grow into (1 + r) dollars next period; and if I leave this money 
invested, it will grow into (1 + r) 2 dollars by the third period. Thus if I 
start with 1/(1 + r) 2 dollars today, I can turn this into $1 in period 3. The 
price of period-3 consumption relative to period-1 consumption is therefore 
1/(1 + r) 2 . Each extra dollar’s worth of consumption in period 3 costs me 
1/(1 + r) 2 dollars today. This implies that the budget constraint will have 
the form 


, c 2 c 3 _ m 2 m 3 

Cl + --1- -7- -T7T — Wl + --b JZ -T7T. 

1 + r (1 + r) 2 1 + r (1 + r) 2 

This is just like the budget constraints we’ve seen before, where the price 
of period-f consumption in terms of today’s consumption is given by 

1 

Pt ~ (l + r)*" 1 ' 

As before, moving to an endowment that has a higher present value at 
these prices will be preferred by any consumer, since such a change will 
necessarily shift the budget set farther out. 

We have derived this budget constraint under the assumption of constant 
interest rates, but it is easy to generalize to the case of changing interest 
rates. Suppose, for example, that the interest earned on savings from period 
1 to 2 is ri, while savings from period 2 to 3 earn r 2 . Then $1 in period 1 
will grow to (1 + j"i)(l + r 2 ) dollars in period 3. The present value of $1 in 
period 3 is therefore 1/(1 + ri)(l + r 2 ). This implies that the correct form 
of the budget constraint is 


c 2 C 3 m 2 m 3 

Cl + 1 + ri + (1 + ri)(l + r 2 ) mi + 1 + ry + (1 + n)(l + r 2 )' 

This expression is not so hard to deal with, but we will typically be content 
to examine the case of constant interest rates. 

Table 10.1 contains some examples of the present value of $1 T years in 
the future at different interest rates. The notable fact about this table is 
how quickly the present value goes down for “reasonable” interest rates. 
For example, at an interest rate of 10 percent, the value of $1 20 years from 
now is only 15 cents. 
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Table The present value of $1 t years in the future. 

10.1 


Rate 

1 

2 

5 

10 

15 

20 

25 

30 

.05 

.95 

.91 

.78 

.61 

.48 

.37 

.30 

.23 

.10 

.91 

.83 

.62 

.39 

.24 

.15 

.09 

.06 

.15 

.87 

.76 

.50 

.25 

.12 

.06 

.03 

.02 

.20 

.83 

.69 

.40 

.16 

.06 

.03 

.01 

.00 


10.8 Use of Present Value 

Let us start by stating an important general principle: present value is the 
only correct way to convert a stream of payments into today’s dollars. This 
principle follows directly from the definition of present value: the present 
value measures the value of a consumer’s endowment of money. As long as 
the consumer can borrow and lend freely at a constant interest rate, an en¬ 
dowment with higher present value can always generate more consumption 
in every period than an endowment with lower present value. Regardless 
of your own tastes for consumption in different periods, you should always 
prefer a stream of money that has a higher present value to one with lower 
present value—since that always gives you more consumption possibilities 
in every period. 

This argument is illustrated in Figure 10.6. In this figure, (rn\,m!^) 
is a worse consumption bundle than the consumer’s original endowment, 
(mi, m 2 ), since it lies beneath the indifference curve through her endow¬ 
ment. Nevertheless, the consumer would prefer (m'i,m' 2 ) to (mi,m 2 ) if 
she is able to borrow and lend at the interest rate r. This is true because 
with the endowment (m^mlf) she can afford to consume a bundle such 
as ( 01 , 02 ), which is unambiguously better than her current consumption 
bundle. 

One very useful application of present value is in valuing the income 
streams offered by different kinds of investments. If you want to compare 
two different investments that yield different streams of payments to see 
which is better, you simply compute the two present values and choose the 
larger one. The investment with the larger present value always gives you 
more consumption possibilities. 

Sometimes it is necessary to purchase an income stream by making a 
stream of payments over time. For example, one could purchase an apart¬ 
ment building by borrowing money from a bank and making mortgage pay¬ 
ments over a number of years. Suppose that the income stream (Mi, M 2 ) 
can be purchased by making a stream of payments (Pi, P 2 ). 

In this case we can evaluate the investment by comparing the present 
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™i nn\ C, 


Higher present value. An endowment with higher present 
value gives the consumer more consumption possibilities in each 
period if she can borrow and lend at the market interest rates. 


Figure 

10.6 


value of the income stream to the present value of the payment stream. If 


Mi + 


M 2 
1 + r 


> Pi + 


Pi 

1 + r ’ 


(10.4) 


the present value of the income stream exceeds the present value of its 
cost, so this is a good investment—it will increase the present value of our 
endowment. 

An equivalent way to value the investment is to use the idea of net 
present value. In order to calculate this number we calculate at the net 
cash flow in each period and then discount this stream back to the present. 
In this example, the net cash flow is (Mi — Pi, M 2 — Pi), and the net present 
value is 

M 2 - P 2 

NPV = Mi - Pi + —- 

1 + r 

Comparing this to equation (10.4) we see that the investment should be 
purchased if and only if the net present value is positive. 

The net present value calculation is very convenient since it allows us to 
add all of the positive and negative cash flows together in each period and 
then discount the resulting stream of cash flows. 


EXAMPLE: Valuing a Stream of Payments 

Suppose that we are considering two investments, A and B. Investment A 
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pays $100 now and will also pay $200 next year. Investment B pays $0 
now, and will generate $310 next year. Which is the better investment? 

The answer depends on the interest rate. If the interest rate is zero, the 
answer is clear- just add up the payments. For if the interest rate is zero, 
then the present-value calculation boils down to summing up the payments. 

If the interest rate is zero, the present value of investment A is 

PV A = 100 + 200 = 300, 
and the present value of investment B is 

PV B = 0 + 310 = 310, 
so B is the preferred investment. 

But we get the opposite answer if the interest rate is high enough. Sup¬ 
pose, for example, that the interest rate is 20 percent. Then the present- 
value calculation becomes 


PV A = 100 + — = 266.67 
310 

PVb = 0 + — = 258.33. 

Now A is the better investment. The fact that A pays back more money 
earlier means that it will have a higher present value when the interest rate 
is large enough. 


EXAMPLE: The True Cost of a Credit Card 

Borrowing money on a credit card is expensive: many companies quote 
yearly interest charges of 15 to 21 percent. However, because of the way 
these finance charges are computed, the true interest rate on credit card 
debt is much higher than this. 

Suppose that a credit card owner charges a $2000 purchase on the first 
day of the month and that the finance charge is 1.5 percent a month. If 
the consumer pays the entire balance by the end of the month, he does not 
have to pay the finance charge. If the consumer pays none of the $2,000, 
he has to pay a finance charge of $2000 x .015 = $30 at the beginning of 
the next month. 

What happens if the consumer pays $1,800 towards the $2000 balance 
on the last day of the month? In this case, the consumer has borrowed 
only $200, so the finance charge should be $3. However, many credit card 
companies charge the consumers much more than this. The reason is that 
many companies base their charges on the “average monthly balance,” even 
if part of that balance is paid by the end of the month. In this example, 
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the average monthly balance would be about $2000 (30 days of the $2000 
balance and 1 day of the $200 balance). The finance charge would therefore 
be slightly less than $30, even though the consumer has only borrowed $200. 
Based on the actual amount of money borrowed, this is an interest rate of 
15 percent a month! 


EXAMPLE: Extending Copyright 

Article I, Section 8 of the U.S. Constitution enables Congress to grant 
patents and copyrights using this language: “To promote the Progress 
of Science and useful Arts, by securing for limited Times to Authors and 
Inventors the exclusive Right to their respective Writings and Discoveries.” 

But what does “limited Times” mean? The lifetime of a patent in the 
United States is fixed at 20 years; the lifetime for copyright is quite differ¬ 
ent. 

The first copyright act, passed by Congress in 1790, offered a 14-year 
term along with a 14-year renewal. Subsequently, the copyright term was 
lengthened to 28 years in 1831, with a 28-year renewal option added in 
1909. In 1962 the term became 47 years, and 67 years in 1978. In 1967 
the term was defined as the life of the author plus 50 years, or 75 years 
for “works for hire.” The 1998 Sonny Bono Copyright Term Extension Act 
lengthened this term to the life of the author plus 70 years for individuals 
and 75-95 years for works for hire. 

It is questionable whether “the life of the author plus 70 years” should 
be considered a limited time. One might ask what additional incentive the 
1998 extension creates for authors to create works? 

Let us look at a simple example. Suppose that the interest rate is 7%. 
Then the increase in present value of extending the copyright term from 
80 to 100 years is about 0.33% of the present value of the first 80 years. 
Those extra 20 years have almost no impact on the present value of the 
copyright at time of creation since they come so far in the future. Hence 
they likely provide miniscule incremental incentive to create the works in 
the first place. 

Given this tiny increase in value from extending the copyright term why 
would it pay anybody to lobby for such a change? The answer is that the 
1998 act extended the copyright term retroactively so that works that were 
near expiration were given a new lease on life. 

For example, it has been widely claimed that Disney lobbied heavily 
for the copyright term extension, since the original Mickey Mouse film, 
Steamboat Willie, was about to go out of copyright. 

Retroactive copyright extensions of this sort make no economic sense, 
since what matters for the authors are the incentives present at the time 
the work is created. If there were no such retroactive extension, it is unlikely 
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that anyone would have bothered to ask for copyright extensions given the 
low economic value of the additional years of protection. 


10.9 Bonds 

Securities are financial instruments that promise certain patterns of pay¬ 
ment schedules. There are many kinds of financial instruments because 
there are many kinds of payment schedules that people want. Financial 
markets give people the opportunity to trade different patterns of cash 
flows over time. These cash flows are typically used to finance consump¬ 
tion at some time or other. 

The particular kind of security that we will examine here is a bond. 
Bonds are issued by governments and corporations. They are basically a 
way to borrow money. The borrower—the agent who issues the bond- 
promises to pay a fixed number of dollars x (the coupon) each period 
until a certain date T (the maturity date), at which point the borrower 
will pay an amount F (the face value) to the holder of the bond. 

Thus the payment stream of a bond looks like (x, x, x ,..., F). If the 
interest rate is constant, the present discounted value of such a bond is 
easy to compute. It is given by 


PV = 


(1 + r) (1 + r) 2 


(1 + r) q 


Note that the present value of a bond will decline if the interest rate 
increases. Why is this? When the interest rate goes up the price now for 
$1 delivered in the future goes down. So the future payments of the bond 
will be worth less now. 

There is a large and developed market for bonds. The market value 
of outstanding bonds will fluctuate as the interest rate fluctuates since 
the present value of the stream of payments represented by the bond will 
change. 

An interesting special kind of a bond is a bond that makes payments 
forever. These are called consols or perpetuities. Suppose that we con¬ 
sider a consol that promises to pay $x dollars a year forever. To compute 
the value of this consol we have to compute the infinite sum: 


PV = 


1 + r (1 + r) 2 


The trick to computing this is to factor out 1/(1 + r) to get 


PV = 


1 +r 


(1 + r) (1 + r) 2 
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But the term in the brackets is just x plus the present value! Substituting 
and solving for PV: 

pv = iih)'* +pv ' 

x 

r 

This wasn’t hard to do, but there is an easy way to get the answer right 
off. How much money, V, would you need at an interest rate r to get x 
dollars forever? Just write down the equation 

Vr = x , 

which says that the interest on V must equal x. But then the value of such 
an investment is given by 



Thus it must be that the present value of a consol that promises to pay x 
dollars forever must be given by x/r. 

For a consol it is easy to see directly how increasing the interest rate 
reduces the value of a bond. Suppose, for example, that a consol is issued 
when the interest rate is 10 percent. Then if it promises to pay $10 a year 
forever, it will be worth $100 now—since $100 would generate $10 a year 
in interest income. 

Now suppose that the interest rate goes up to 20 percent. The value of 
the consol must fall to $50, since it only takes $50 to earn $10 a year at a 
20 percent interest rate. 

The formula for the consol can be used to calculate an approximate value 
of a long-term bond. If the interest rate is 10 percent, for example, the 
value of $1 30 years from now is only 6 cents. For the size of interest rates 
we usually encounter, 30 years might as well be infinity. 

EXAMPLE: Installment Loans 

Suppose that you borrow $1000 that you promise to pay back in 12 monthly 
installments of $100 each. What rate of interest are you paying? 

At first glance it seems that your interest rate is 20 percent: you have 
borrowed $1000, and you are paying back $1200. But this analysis is incor¬ 
rect. For you haven’t really borrowed $1000 for an entire year. You have 
borrowed $1000 for a month, and then you pay back $100. Then you only 
have borrowed $900, and you owe only a month’s interest on the $900. You 
borrow that for a month and then pay back another $100. And so on. 

The stream of payments that we want to value is 

( 1000 ,- 100 ,- 100 ,...,- 100 ). 

We can find the interest rate that makes the present value of this stream 
equal to zero by using a calculator or a computer. The actual interest rate 
that you are paying on the installment loan is about 35 percent! 
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10.10 Taxes 

In the United States, interest payments are taxed as ordinary income. This 
means that you pay the same tax on interest income as on labor income. 
Suppose that your marginal tax bracket is t , so that each extra dollar of 
income, Am, increases your tax liability by tAm. Then if you invest X 
dollars in an asset, you’ll receive an interest payment of rX. But you’ll 
also have to pay taxes of trX on this income, which will leave you with 
only (1 — t)rX dollars of after-tax income. We call the rate (1 — t)r the 
after-tax interest rate. 

What if you decide to borrow X dollars, rather than lend them? Then 
you’ll have to make an interest payment of rX. In the United States, some 
interest payments are tax deductible and some are not. For example, the 
interest payments for a mortgage are tax deductable, but interest payments 
on ordinary consumer loans are not. On the other hand, businesses can 
deduct most kinds of the interest payments that they make. 

If a particular interest payment is tax deductible, you can subtract your 
interest payment from your other income and only pay taxes on what’s left. 
Thus the rX dollars you pay in interest will reduce your tax payments by 
trX. The total cost of the X dollars you borrowed will be rX — trX = 
(1 -t)rX. 

Thus the after-tax interest rate is the same whether you are borrowing 
or lending, for people in the same tax bracket. The tax on saving will 
reduce the amount of money that people want to save, but the subsidy on 
borrowing will increase the amount of money that people want to borrow. 


EXAMPLE: Scholarships and Savings 

Many students in the United States receive some form of financial aid to 
help defray college costs. The amount of financial aid a student receives 
depends on many factors, but one important factor is the family’s ability to 
pay for college expenses. Most U.S. colleges and universities use a standard 
measure of ability to pay calculated by the College Entrance Examination 
Board (CEEB). 

If a student wishes to apply for financial aid, his or her family must fill 
out a questionnaire describing their financial circumstances. The CEEB 
uses the information on the income and assets of the parents to construct 
a measure of “adjusted available income.” The fraction of their adjusted 
available income that parents are expected to contribute varies between 
22 and 47 percent, depending on income. In 1985, parents with a total 
before-tax income of around $35,000 dollars were expected to contribute 
about $7000 toward college expenses. 
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Each additional dollar of assets that the parents accumulate increases 
their expected contribution and decreases the amount of financial aid that 
their child can hope to receive. The formula used by the CEEB effectively 
imposes a tax on parents who save for their children’s college education. 
Martin Feldstein, President of the National Bureau of Economic Research 
(NBER) and Professor of Economics at Harvard University, calculated the 
magnitude of this tax. 3 

Consider the situation of some parents contemplating saving an addi¬ 
tional dollar just as their daughter enters college. At a 6 percent rate of 
interest, the future value of a dollar 4 years from now is $1.26. Since federal 
and state taxes must be paid on interest income, the dollar yields $1.19 in 
after-tax income in 4 years. However, since this additional dollar of savings 
increases the total assets of the parents, the amount of aid received by the 
daughter goes down during each of her four college years. The effect of this 
“education tax” is to reduce the future value of the dollar to only 87 cents 
after 4 years. This is equivalent to an income tax of 150 percent! 

Feldstein also examined the savings behavior of a sample of middle-class 
households with pre-college children. He estimates that a household with 
income of $40,000 a year and two college-age children saves about 50 per¬ 
cent less than they would otherwise due to the combination of federal, state, 
and “education” taxes that they face. 


10.11 Choice of the Interest Rate 

In the above discussion, we’ve talked about “the interest rate.” In real life 
there are many interest rates: there are nominal rates, real rates, before-tax 
rates, after-tax rates, short-term rates, long-term rates, and so on. Which 
is the “right” rate to use in doing present-value analysis? 

The way to answer this question is to think about the fundamentals. 
The idea of present discounted value arose because we wanted to be able 
to convert money at one point in time to an equivalent amount at another 
point in time. “The interest rate” is the return on an investment that 
allows us to transfer funds in this way. 

If we want to apply this analysis when there are a variety of interest 
rates available, we need to ask which one has the properties most like the 
stream of payments we are trying to value. If the stream of payments 
is not taxed, we should use an after-tax interest rate. If the stream of 
payments will continue for 30 years, we should use a long-term interest 
rate. If the stream of payments is risky, we should use the interest rate 
on an investment with similar risk characteristics. (We’ll have more to say 
later about what this last statement actually means.) 

3 Martin Feldstein, “College Scholarship Rules and Private Savings,” American Eco¬ 
nomic Review, 85, 3 (June 1995). 
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The interest rate measures the opportunity cost of funds—the value 
of alternative uses of your money. So every stream of payments should be 
compared to your best alternative that has similar characteristics in terms 
of tax treatment, risk, and liquidity. 


Summary 

1. The budget constraint for intertemporal consumption can be expressed 
in terms of present value or future value. 

2. The comparative statics results derived earlier for general choice prob¬ 
lems can be applied to intertemporal consumption as well. 

3. The real rate of interest measures the extra consumption that you can 
get in the future by giving up some consumption today. 

4. A consumer who can borrow and lend at a constant interest rate should 
always prefer an endowment with a higher present value to one with a lower 
present value. 


REVIEW QUESTIONS 


1. How much is $1 million to be delivered 20 years in the future worth 
today if the interest rate is 20 percent? 

2. As the interest rate rises, does the intertemporal budget constraint be¬ 
come steeper or flatter? 

3. Would the assumption that goods are perfect substitutes be valid in a 
study of intertemporal food purchases? 

4. A consumer, who is initially a lender, remains a lender even after a 
decline in interest rates. Is this consumer better off or worse off after the 
change in interest rates? If the consumer becomes a borrower after the 
change is he better off or worse off? 

5. What is the present value of $100 one year from now if the interest rate 
is 10%? What is the present value if the interest rate is 5%? 


CHAPTER 



ASSET 

MARKETS 


Assets are goods that provide a flow of services over time. Assets can 
provide a flow of consumption services, like housing services, or can provide 
a flow of money that can be used to purchase consumption. Assets that 
provide a monetary flow are called financial assets. 

The bonds that we discussed in the last chapter are examples of financial 
assets. The flow of services they provide is the flow of interest payments. 
Other sorts of financial assets such as corporate stock provide different 
patterns of cash flows. In this chapter we will examine the functioning of 
asset markets under conditions of complete certainty about the future flow 
of services provided by the asset. 


11.1 Rates of Return 

Under this admittedly extreme hypothesis, we have a simple principle re¬ 
lating asset rates of return: if there is no uncertainty about the cash flow 
provided by assets, then all assets have to have the same rate of return. 
The reason is obvious: if one asset had a higher rate of return than another, 
and both assets were otherwise identical, then no one would want to buy 
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the asset with the lower rate of return. So in equilibrium, all assets that 
are actually held must pay the same rate of return. 

Let us consider the process by which these rates of return adjust. Con¬ 
sider an asset A that has current price po and is expected to have a price of 
Pi tomorrow. Everyone is certain about what today’s price of the asset is, 
and everyone is certain about what tomorrow’s price will be. We suppose 
for simplicity that there are no dividends or other cash payments between 
periods 0 and 1. Suppose furthermore that there is another investment, B, 
that one can hold between periods 0 and 1 that will pay an interest rate of 
r. Now consider two possible investment plans: either invest one dollar in 
asset A and cash it in next period, or invest one dollar in asset B and earn 
interest of r dollars over the period. 

What are the values of these two investment plans at the end of the first 
period? We first ask how many units of the asset we must purchase to 
make a one dollar investment in it. Letting x be this amount we have the 
equation 

p 0 x = 1 


- 

Po 

It follows that the future value of one dollar’s worth of this asset next 
period will be 

FV = pix = 

Po 

On the other hand, if we invest one dollar in asset B, we will have 1 + r 
dollars next period. If assets A and B are both held in equilibrium, then 
a dollar invested in either one of them must be worth the same amount 
second period. Thus we have an equilibrium condition: 


1 + r = 


Pi 

Po' 


What happens if this equality is not satisfied? Then there is a sure way 
to make money. For example, if 

1 + r > —, 

Po 

people who own asset A can sell one unit for po dollars in the first period 
and invest the money in asset B. Next period their investment in asset B 
will be worth po(l + t), which is greater than p\ by the above equation. 
This will guarantee that second period they will have enough money to 
repurchase asset A, and be back where they started from, but now with 
extra money. 
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This kind of operation- buying some of one asset and selling some of 
another to realize a sure return—is known as riskless arbitrage, or ar¬ 
bitrage for short. As long as there are people around looking for “sure 
things” we would expect that well-functioning markets should quickly elim¬ 
inate any opportunities for arbitrage. Therefore, another way to state our 
equilibrium condition is to say that in equilibrium there should be no oppor¬ 
tunities for arbitrage. We’ll refer to this as the no arbitrage condition. 

But how does arbitrage actually work to eliminate the inequality? In the 
example given above, we argued that if 1 + r > p\/po, then anyone who 
held asset A would want to sell it first period, since they were guaranteed 
enough money to repurchase it second period. But who would they sell it 
to? Who would want to buy it? There would be plenty of people willing 
to supply asset A at p 0 , but there wouldn’t be anyone foolish enough to 
demand it at that price. 

This means that supply would exceed demand and therefore the price 
will fall. How far will it fall? Just enough to satisfy the arbitrage condition: 
until 1 + r = p\ /po. 


11.2 Arbitrage and Present Value 

We can rewrite the arbitrage condition in a useful way by cross multiplying 
to get 


This says that the current price of an asset must be its present value. 
Essentially we have converted the future-value comparison in the arbitrage 
condition to a present-value comparison. So if the no arbitrage condition is 
satisfied, then we are assured that assets must sell for their present values. 
Any deviation from present-value pricing leaves a sure way to make money. 


11.3 Adjustments for Differences among Assets 

The no arbitrage rule assumes that the asset services provided by the two 
assets are identical, except for the purely monetary difference. If the ser¬ 
vices provided by the assets have different characteristics, then we would 
want to adjust for those differences before we blandly assert that the two 
assets must have the same equilibrium rate of return. 

For example, one asset might be easier to sell than the other. We some¬ 
times express this by saying that one asset is more liquid than another. 
In this case, we might want to adjust the rate of return to take account of 
the difficulty involved in finding a buyer for the asset. Thus a house that 
is worth $100,000 is probably a less liquid asset than $100,000 in Treasury 
bills. 
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Similarly, one asset might be riskier than another. The rate of return 
on one asset may be guaranteed, while the rate of return on another asset 
may be highly risky. We’ll examine ways to adjust for risk differences in 
Chapter 13. 

Here we want to consider two other types of adjustment we might make. 
One is adjustment for assets that have some return in consumption value, 
and the other is for assets that have different tax characteristics. 


11.4 Assets with Consumption Returns 

Many assets pay off only in money. But there are other assets that pay 
off in terms of consumption as well. The prime example of this is housing. 
If you own a house that you live in, then you don’t have to rent living 
quarters; thus part of the “return” to owning the house is the fact that you 
get to live in the house without paying rent. Or, put another way, you get 
to pay the rent for your house to yourself. This latter way of putting it 
sounds peculiar, but it contains an important insight. 

It is true that you don’t make an explicit rental payment to yourself for 
the privilege of living in your house, but it turns out to be fruitful to think 
of a homeowner as implicitly making such a payment. The implicit rental 
rate on your house is the rate at which you could rent a similar house. Or, 
equivalently, it is the rate at which you could rent your house to someone 
else on the open market. By choosing to “rent your house to yourself” you 
are forgoing the opportunity of earning rental payments from someone else, 
and thus incurring an opportunity cost. 

Suppose that the implicit rental payment on your house would work 
out to T dollars per year. Then part of the return to owning your house 
is the fact that it generates for you an implicit income of T dollars per 
year—the money that you would otherwise have to pay to live in the same 
circumstances as you do now. 

But that is not the entire return on your house. As real estate agents 
never tire of telling us, a house is also an investment. When you buy a house 
you pay a significant amount of money for it, and you might reasonably 
expect to earn a monetary return on this investment as well, through an 
increase in the value of your house. This increase in the value of an asset 
is known as appreciation. 

Let us use A to represent the expected appreciation in the dollar value 
of your house over a year. The total return to owning your house is the 
sum of the rental return, T, and the investment return, A. If your house 
initially cost P, then the total rate of return on your initial investment in 
housing is 

, T + A 
h = -p~ 

This total rate of return is composed of the consumption rate of return, 
T/P, and the investment rate of return, A/P. 
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Let us use r to represent the rate of return on other financial assets. 
Then the total rate of return on housing should, in equilibrium, be equal 
to r: 

T + A 


Think about it this way. At the beginning of the year, you can invest P 
in a bank and earn rP dollars, or you can invest P dollars in a house and 
save T dollars of rent and earn A dollars by the end of the year. The total 
return from these two investments has to be the same. If T + A < rP you 
would be better off investing your money in the bank and paying T dollars 
in rent. You would then have rP — T > A dollars at the end of the year. 
If T + A > rP , then housing would be the better choice. (Of course, this 
is ignoring the real estate agent’s commission and other transactions costs 
associated with the purchase and sale.) 

Since the total return should rise at the rate of interest, the financial 
rate of return A/P will generally be less than the rate of interest. Thus 
in general, assets that pay off in consumption will in equilibrium have a 
lower financial rate of return than purely financial assets. This means that 
buying consumption goods such as houses, or paintings, or jewelry solely 
as a financial investment is probably not a good idea since the rate of 
return on these assets will probably be lower than the rate of return on 
purely financial assets, because part of the price of the asset reflects the 
consumption return that people receive from owning such assets. On the 
other hand, if you place a sufficiently high value on the consumption return 
on such assets, or you can generate rental income from the assets, it may 
well make sense to buy them. The total return on such assets may well 
make this a sensible choice. 


11.5 Taxation of Asset Returns 

The Internal Revenue Service distinguishes two kinds of asset returns for 
purposes of taxation. The first kind is the dividend or interest return. 
These are returns that are paid periodically—each year or each month- 
over the life of the asset. You pay taxes on interest and dividend income at 
your ordinary tax rate, the same rate that you pay on your labor income. 

The second kind of returns are called capital gains. Capital gains occur 
when you sell an asset at a price higher than the price at which you bought 
it. Capital gains are taxed only when you actually sell the asset. Under 
the current tax law, capital gains are taxed at the same rate as ordinary 
income, but there are some proposals to tax them at a more favorable rate. 

It is sometimes argued that taxing capital gains at the same rate as 
ordinary income is a “neutral” policy. However, this claim can be disputed 
for at least two reasons. The first reason is that the capital gains taxes are 
only paid when the asset is sold, while taxes on dividends or interest are 
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paid every year. The fact that the capital gains taxes are deferred until 
time of sale makes the effective tax rate on capital gains lower than the 
tax rate on ordinary income. 

A second reason that equal taxation of capital gains and ordinary income 
is not neutral is that the capital gains tax is based on the increase in the 
dollar value of an asset. If asset values are increasing just because of 
inflation, then a consumer may owe taxes on an asset whose real value 
hasn’t changed. For example, suppose that a person buys an asset for $100 
and 10 years later it is worth $200. Suppose that the general price level 
also doubles in this same ten-year period. Then the person would owe 
taxes on a $100 capital gain even though the purchasing power of his asset 
hadn’t changed at all. This tends to make the tax on capital gains higher 
than that on ordinary income. Which of the two effects dominates is a 
controversial question. 

In addition to the differential taxation of dividends and capital gains 
there are many other aspects of the tax law that treat asset returns differ¬ 
ently. For example, in the United States, municipal bonds, bonds issued 
by cities or states, are not taxed by the Federal government. As we indi¬ 
cated earlier, the consumption returns from owner-occupied housing is not 
taxed. Furthermore, in the United States even part of the capital gains 
from owner-occupied housing is not taxed. 

The fact that different assets are taxed differently means that the arbi¬ 
trage rule must adjust for the tax differences in comparing rates of return. 
Suppose that one asset pays a before-tax interest rate, r b , and another as¬ 
set pays a return that is tax exempt, r e . Then if both assets are held by 
individuals who pay taxes on income at rate t, we must have 

(1 - t)r b = r e . 

That is, the after-tax return on each asset must be the same. Otherwise, 
individuals would not want to hold both assets—it would always pay them 
to switch exclusively to holding the asset that gave them the higher after¬ 
tax return. Of course, this discussion ignores other differences in the assets 
such as liquidity, risk, and so on. 


11.6 Market Bubbles 

Suppose you are contemplating buying a house that is absolutely certain 
to be worth $220,000 a year from now and that the current interest rate 
(reflecting your alternative investment opportunities) is 10%. A fair price 
for the house would be the present value, $200,000. 

Now suppose that things aren’t quite so certain: many people believe 
that the house will be worth $220,000 in a year, but there are no guarantees. 
We would expect that the house would sell for somewhat less than $200,000 
due to the additional risk associated with purchase. 
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Suppose the year goes by and the house is worth $240,000, far more than 
anticipated. The house value went up by 20%, even though the prevailing 
interest rate was 10%. It may be that this experience will lead people to 
revise their view about how much the house will be worth in the future— 
who knows, maybe it will go up by 20% or even more next year. 

If many people hold such beliefs, they can bid up the price of housing 
now—which may encourage others to make even more optimistic forecasts 
about the housing market. As in our discussion of price adjustment, assets 
that people expect to have a higher return than the rate of interest get 
pushed up in price. The higher price will tend to reduce current demand 
but it also may encourage people to expect an even higher return in the 
future. 

The first effect—high prices reducing demand—tends to stablize prices. 
The second effect—high prices leading to an expectation of even higher 
prices in the future—tends to destabilize prices. 

This is an example of an asset bubble. In a bubble, the price of an 
asset increases, for one reason or another, and this leads people to expect 
the price to go up even more in the future. But if they expect the asset 
price to rise significantly in the future, they will try to buy more today, 
pushing prices up even more rapidly. 

Financial markets may be subject to such bubbles, particularly when the 
participants are inexperienced. For example, in 2000-01 we saw a dramatic 
run-up in the prices of technology stocks and in 2005-06 we saw a bubble 
in house prices in much of the United States and many other countries. 

All bubbles eventually burst. Prices fall and some people are left holding 
assets that are worth much less than they paid for them. 

The key to avoiding bubbles is to look at economic fundamentals. In the 
midst of the housing bubble in the United States, the ratio between the 
price of a house and the yearly rental rate on an identical house became 
far larger than historical norms. This gap presumably reflected buyers’ 
expectations of future price increases. 

Similarly, the ratio of median house prices to median income reached 
historical highs. Both of these were warning signs that the high prices 
were unsustainable. 

“This time it’s different” can be a very hazardous belief to hold, partic¬ 
ularly when it comes to financial markets. 


11.7 Applications 


The fact that all riskless assets must earn the same return is obvious, but 
very important. It has surprisingly powerful implications for the function¬ 
ing of asset markets. 
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Depletable Resources 

Let us study the market equilibrium for a depletable resource like oil. Con¬ 
sider a competitive oil market, with many suppliers, and suppose for sim¬ 
plicity that there are zero costs to extract oil from the ground. Then how 
will the price of oil change over time? 

It turns out that the price of oil must rise at the rate of interest. To see 
this, simply note that oil in the ground is an asset like any other asset. If 
it is worthwhile for a producer to hold it from one period to the next, it 
must provide a return to him equivalent to the financial return he could 
get elsewhere. If we let p t +i and p t be the prices at times t + 1 and t, then 
we have 

Pt+i = (1 + r)p t 

as our no arbitrage condition in the oil market. 

The argument boils down to this simple idea: oil in the ground is like 
money in the bank. If money in the bank earns a rate of return of r, then 
oil in the ground must earn the same rate of return. If oil in the ground 
earned a higher return than money in the bank, then no one would take oil 
out of the ground, preferring to wait till later to extract it, thus pushing 
the current price of oil up. If oil in the ground earned a lower return than 
money in the bank, then the owners of oil wells would try to pump their oil 
out immediately in order to put the money in the bank, thereby depressing 
the current price of oil. 

This argument tells us how the price of oil changes. But what determines 
the price level itself? The price level turns out to be determined by the 
demand for oil. Let us consider a very simple model of the demand side of 
the market. 

Suppose that the demand for oil is constant at D barrels a year and 
that there is a total world supply of S barrels. Thus we have a total of 
T = S/D years of oil left. When the oil has been depleted we will have to 
use an alternative technology, say liquefied coal, which can be produced at 
a constant cost of C dollars per barrel. We suppose that liquefied coal is a 
perfect substitute for oil in all applications. 

Now, T years from now, when the oil is just being exhausted, how much 
must it sell for? Clearly it must sell for C dollars a barrel, the price of 
its perfect substitute, liquefied coal. This means that the price today of a 
barrel of oil, p$, must grow at the rate of interest r over the next T years 
to be equal to C. This gives us the equation 

p 0 {l + r) T = C 
C 

(1 + r) T ' 


Po = 
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This expression gives us the current price of oil as a function of the 
other variables in the problem. We can now ask interesting comparative 
statics questions. For example, what happens if there is an unforeseen new 
discovery of oil? This means that T, the number of years remaining of oil, 
will increase, and thus (1 + r) T will increase, thereby decreasing pq. So 
an increase in the supply of oil will, not surprisingly, decrease its current 
price. 

What if there is a technological breakthrough that decreases the value 
of Cl Then the above equation shows that po must decrease. The price 
of oil has to be equal to the price of its perfect substitute, liquefied coal, 
when liquefied coal is the only alternative. 


When to Cut a Forest 


Suppose that the size of a forest—measured in terms of the lumber that 
you can get from it—is some function of time, F(t). Suppose further that 
the price of lumber is constant and that the rate of growth of the tree starts 
high and gradually declines. If there is a competitive market for lumber, 
when should the forest be cut for timber? 

Answer: when the rate of growth of the forest equals the interest rate. 
Before that, the forest is earning a higher rate of return than money in the 
bank, and after that point it is earning less than money in the bank. The 
optimal time to cut a forest is when its growth rate just equals the interest 
rate. 

We can express this more formally by looking at the present value of 
cutting the forest at time T. This will be 


PV = 


F(T) 

(1 + r) T ' 


We want to find the choice of T that maximizes the present value—that 
is, that makes the value of the forest as large as possible. If we choose 
a very small value of T, the rate of growth of the forest will exceed the 
interest rate, which means that the PV would be increasing so it would 
pay to wait a little longer. On the other hand, if we consider a very large 
value of T, the forest would be growing more slowly than the interest rate, 
so the PV would be decreasing. The choice of T that maximizes present 
value occurs when the rate of growth of the forest just equals the interest 
rate. 

This argument is illustrated in Figure 11.1. In Figure 11.1A we have 
plotted the rate of growth of the forest and the rate of growth of a dollar 
invested in a bank. If we want to have the largest amount of money at 
some unspecified point in the future, we should always invest our money 
in the asset with the highest return available at each point in time. When 
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Harvesting a forest. The optimal time to cut a forest is when 
the rate of growth of the forest equals the interest rate. 


the forest is young, it is the asset with the highest return. As it ma¬ 
tures, its rate of growth declines, and eventually the bank offers a higher 
return. 

The effect on total wealth is illustrated in Figure 11.IB. Before T wealth 
grows most rapidly when invested in the forest. After T it grows most 
rapidly when invested in the bank. Therefore, the optimal strategy is to 
invest in the forest up until time T, then harvest the forest, and invest the 
proceeds in the bank. 


EXAMPLE: Gasoline Prices during the Gulf War 

In the Summer of 1990 Iraq invaded Kuwait. As a response to this, the 
United Nations imposed a blockade on oil imports from Iraq. Immediately 
after the blockade was announced the price of oil jumped up on world mar¬ 
kets. At the same time price of gasoline at U.S. pumps increased signifi¬ 
cantly. This in turn led to cries of “war profiteering” and several segments 
about the oil industry on the evening news broadcasts. 

Those who felt the price increase was unjustified argued that it would 
take at least 6 weeks for the new, higher-priced oil to wend its way across 
to the Atlantic and to be refined into gasoline. The oil companies, they 
argued, were making “excessive” profits by raising the price of gasoline that 
had already been produced using cheap oil. 

Let’s think about this argument as economists. Suppose that you own an 
asset—say gasoline in a storage tank—that is currently worth $1 a gallon. 
Six weeks from now, you know that it will be worth $1.50 a gallon. What 
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price will you sell it for now? Certainly you would be foolish to sell it 
for much less than $1.50 a gallon—at any price much lower than that you 
would be better off letting the gasoline sit in the storage tank for 6 weeks. 
The same intertemporal arbitrage reasoning about extracting oil from the 
ground applies to gasoline in a storage tank. The (appropriate discounted) 
price of gasoline tomorrow has to equal the price of gasoline today if you 
want firms to supply gasoline today. 

This makes perfect sense from a welfare point of view as well: if gasoline 
is going to be more expensive in the near future, doesn’t it make sense 
to consume less of it today? The increased price of gasoline encourages 
immediate conservation measures and reflects the true scarcity price of 
gasoline. 

Ironically, the same phenomenon occured two years later in Russia. Dur¬ 
ing the transition to a market economy, Russian oil sold for about $3 a 
barrel at a time when the world price was about $19 a barrel. The oil pro¬ 
ducers anticipated that the price of oil would soon be allowed to rise—so 
they tried to hold back as much oil as possible from current production. As 
one Russian producer put it, “Have you seen anyone in New York selling 
one dollar for 10 cents?” The result was long lines in front of the gasoline 
pumps for Russian consumers. 1 


11.8 Financial Institutions 

Asset markets allow people to change their pattern of consumption over 
time. Consider, for example, two people A and B who have different en¬ 
dowments of wealth. A might have $100 today and nothing tomorrow, 
while B might have $100 tomorrow and nothing today. It might well hap¬ 
pen that each would rather have $50 today and $50 tomorrow. But they 
can reach this pattern of consumption simply by trading: A gives B $50 
today, and B gives A $50 tomorrow. 

In this particular case, the interest rate is zero: A lends B $50 and 
only gets $50 in return the next day. If people have convex preferences 
over consumption today and tomorrow, they would like to smooth their 
consumption over time, rather than consume everything in one period, 
even if the interest rate were zero. 

We can repeat the same kind of story for other patterns of asset endow¬ 
ments. One individual might have an endowment that provides a steady 
stream of payments and prefer to have a lump sum, while another might 
have a lump sum and prefer a steady stream. For example, a twenty-year- 
old individual might want to have a lump sum of money now to buy a 
house, while a sixty-year-old might want to have a steady stream of money 

1 See Louis Uchitelle, “Russians Line Up for Gas as Refineries Sit on Cheap Oil,” New 

York Times, July 12, 1992, page 4. 
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to finance his retirement. It is clear that both of these individuals could 
gain by trading their endowments with each other. 

In a modern economy financial institutions exist to facilitate these trades. 
In the case described above, the sixty-year-old can put his lump sum of 
money in the bank, and the bank can then lend it to the twenty-year-old. 
The twenty-year-old then makes mortgage payments to the bank, which 
are, in turn, transferred to the sixty-year-old as interest payments. Of 
course, the bank takes its cut for arranging the trade, but if the banking 
industry is sufficiently competitive, this cut should end up pretty close to 
the actual costs of doing business. 

Banks aren’t the only kind of financial institution that allow one to 
reallocate consumption over time. Another important example is the stock 
market. Suppose that an entrepreneur starts a company that becomes 
successful. In order to start the company, the entrepreneur probably had 
some financial backers who put up money to help him get started—to pay 
the bills until the revenues started rolling in. Once the company has been 
established, the owners of the company have a claim to the profits that 
the company will generate in the future: they have a claim to a stream of 
payments. 

But it may well be that they prefer a lump-sum reward for their efforts 
now. In this case, the owners can decide to sell the firm to other people 
via the stock market. They issue shares in the company that entitle the 
shareholders to a cut of the future profits of the firm in exchange for a 
lump-sum payment now. People who want to purchase part of the stream 
of profits of the firm pay the original owners for these shares. In this way, 
both sides of the market can reallocate their wealth over time. 

There are a variety of other institutions and markets that help facili¬ 
tate intertemporal trade. But what happens when the buyers and sellers 
aren’t evenly matched? What happens if more people want to sell con¬ 
sumption tomorrow than want to buy it? Just as in any market, if the 
supply of something exceeds the demand, the price will fall. In this case, 
the price of consumption tomorrow will fall. We saw earlier that the price 
of consumption tomorrow was given by 

1 


so this means that the interest rate must rise. The increase in the interest 
rate induces people to save more and to demand less consumption now, 
and thus tends to equate demand and supply. 


Summary 

1. In equilibrium, all assets with certain payoffs must earn the same rate 
of return. Otherwise there would be a riskless arbitrage opportunity. 



APPENDIX 215 


2. The fact that all assets must earn the same return implies that all assets 
will sell for their present value. 

3. If assets are taxed differently, or have different risk characteristics, then 
we must compare their after-tax rates of return or their risk-adjusted rates 
of return. 

REVIEW QUESTIONS 


1. Suppose asset A can be sold for $11 next period. If assets similar to A 
are paying a rate of return of 10%, what must be asset A’s current price? 

2. A house, which you could rent for $10,000 a year and sell for $110,000 a 
year from now, can be purchased for $100,000. What is the rate of return 
on this house? 

3. The payments of certain types of bonds (e.g., municipal bonds) are not 
taxable. If similar taxable bonds are paying 10% and everyone faces a 
marginal tax rate of 40%, what rate of return must the nontaxable bonds 
pay? 

4. Suppose that a scarce resource, facing a constant demand, will be ex¬ 
hausted in 10 years. If an alternative resource will be available at a price 
of $40 and if the interest rate is 10%, what must the price of the scarce 
resource be today? 


APPENDIX 

Suppose that you invest $1 in an asset yielding an interest rate r where the 
interest is paid once a year. Then after T years you will have (1 + r) T dollars. 
Suppose now that the interest is paid monthly. This means that the monthly 
interest rate will be r/12, and there will be 12 T payments, so that after T years 
you will have (l + r/12) 12T dollars. If the interest rate is paid daily, you will have 
(1 + r/365) 365T and so on. 

In general, if the interest is paid n times a year, you will have (1 + r/n) nT 
dollars after T years. It is natural to ask how much money you will have if the 
interest is paid continuously. That is, we ask what is the limit of this expression 
as n goes to infinity. It turns out that this is given by the following expression: 

e rT = lim (1 + r/n) nT , 

n —»oo 

where e is 2.7183 ..., the base of natural logarithms. 

This expression for continuous compounding is very convenient for calculations. 
For example, let us verify the claim in the text that the optimal time to harvest 
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the forest is when the rate of growth of the forest equals the interest rate. Since 
the forest will be worth F(T) at time T, the present value of the forest harvested 
at time T is 

V(T) = = e~ rT F(T). 

e rl 

In order to maximize the present value, we differentiate this with respect to T 
and set the resulting expression equal to zero. This yields 

V'(T) = e~ rT F'(T) - re~ rT F(T) = 0 

or 

F'(T) - rF(T) = 0. 

This can be rearranged to establish the result: 

F'(T) 
r ~ —-—- 
F(T) 

This equation says that the optimal value of T satisfies the condition that the 
rate of interest equals the rate of growth of the value of the forest. 




CHAPTER 



UNCERTAINTY 


Uncertainty is a fact of life. People face risks every time they take a shower, 
walk across the street, or make an investment. But there are financial insti¬ 
tutions such as insurance markets and the stock market that can mitigate 
at least some of these risks. We will study the functioning of these mar¬ 
kets in the next chapter, but first we must study individual behavior with 
respect to choices involving uncertainty. 


12.1 Contingent Consumption 

Since we now know all about the standard theory of consumer choice, let’s 
try to use what we know to understand choice under uncertainty. The first 
question to ask is what is the basic “thing” that is being chosen? 

The consumer is presumably concerned with the probability distri¬ 
bution of getting different consumption bundles of goods. A probability 
distribution consists of a list of different outcomes—in this case, consump¬ 
tion bundles—and the probability associated with each outcome. When a 
consumer decides how much automobile insurance to buy or how much to 
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invest in the stock market, he is in effect deciding on a pattern of probability 
distribution across different amounts of consumption. 

For example, suppose that you have $100 now and that you are con¬ 
templating buying lottery ticket number 13. If number 13 is drawn in the 
lottery, the holder will be paid $200. This ticket costs, say, $5. The two 
outcomes that are of interest are the event that the ticket is drawn and the 
event that it isn’t. 

Your original endowment of wealth—the amount that you would have if 
you did not purchase the lottery ticket—is $100 if 13 is drawn, and $100 
if it isn’t drawn. But if you buy the lottery ticket for $5, you will have 
a wealth distribution consisting of $295 if the ticket is a winner, and $95 
if it is not a winner. The original endowment of probabilities of wealth 
in different circumstances has been changed by the purchase of the lottery 
ticket. Let us examine this point in more detail. 

In this discussion we’ll restrict ourselves to examining monetary gambles 
for convenience of exposition. Of course, it is not money alone that mat¬ 
ters; it is the consumption that money can buy that is the ultimate “good” 
being chosen. The same principles apply to gambles over goods, but re¬ 
stricting ourselves to monetary outcomes makes things simpler. Second, 
we will restrict ourselves to very simple situations where there are only a 
few possible outcomes. Again, this is only for reasons of simplicity. 

Above we described the case of gambling in a lottery; here we’ll consider 
the case of insurance. Suppose that an individual initially has $35,000 
worth of assets, but there is a possibility that he may lose $10,000. For 
example, his car may be stolen, or a storm may damage his house. Suppose 
that the probability of this event happening is p = .01. Then the probability 
distribution the person is facing is a 1 percent probability of having $25,000 
of assets, and a 99 percent probability of having $35,000. 

Insurance offers a way to change this probability distribution. Suppose 
that there is an insurance contract that will pay the person $100 if the loss 
occurs in exchange for a $1 premium. Of course the premium must be paid 
whether or not the loss occurs. If the person decides to purchase $10,000 
dollars of insurance, it will cost him $100. In this case he will have a 1 
percent chance of having $34,900 ($35,000 of other assets — $10,000 loss + 
$ 10,000 payment from the insurance payment - $100 insurance premium) 
and a 99 percent chance of having $34,900 ($35,000 of assets — $100 in¬ 
surance premium). Thus the consumer ends up with the same wealth no 
matter what happens. He is now fully insured against loss. 

In general, if this person purchases K dollars of insurance and has to pay 
a premium 7 K 1 then he will face the gamble : 1 

probability .01 of getting $25,000 + K — 7 K 


1 


The Greek letter 7 , gamma, is pronounced “gam-ma. : 
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and 

probability .99 of getting $35,000 — 7 K. 

What kind of insurance will this person choose? Well, that depends on 
his preferences. He might be very conservative and choose to purchase a lot 
of insurance, or he might like to take risks and not purchase any insurance 
at all. People have different preferences over probability distributions in 
the same way that they have different preferences over the consumption of 
ordinary goods. 

In fact, one very fruitful way to look at decision making under uncertainty 
is just to think of the money available under different circumstances as 
different goods. A thousand dollars after a large loss has occurred may 
mean a very different thing from a thousand dollars when it hasn’t. Of 
course, we don’t have to apply this idea just to money: an ice cream cone 
if it happens to be hot and sunny tomorrow is a very different good from 
an ice cream cone if it is rainy and cold. In general, consumption goods will 
be of different value to a person depending upon the circumstances under 
which they become available. 

Let us think of the different outcomes of some random event as being 
different states of nature. In the insurance example given above there 
were two states of nature: the loss occurs or it doesn’t. But in general 
there could be many different states of nature. We can then think of 
a contingent consumption plan as being a specification of what will 
be consumed in each different state of nature—each different outcome of 
the random process. Contingent means depending on something not yet 
certain, so a contingent consumption plan means a plan that depends on the 
outcome of some event. In the case of insurance purchases, the contingent 
consumption was described by the terms of the insurance contract: how 
much money you would have if a loss occurred and how much you would 
have if it didn’t. In the case of the rainy and sunny days, the contingent 
consumption would just be the plan of what would be consumed given the 
various outcomes of the weather. 

People have preferences over different plans of consumption, just like 
they have preferences over actual consumption. It certainly might make 
you feel better now to know that you are fully insured. People make choices 
that reflect their preferences over consumption in different circumstances, 
and we can use the theory of choice that we have developed to analyze 
those choices. 

If we think about a contingent consumption plan as being just an ordi¬ 
nary consumption bundle, we are right back in the framework described in 
the previous chapters. We can think of preferences as being defined over 
different consumption plans, with the “terms of trade” being given by the 
budget constraint. We can then model the consumer as choosing the best 
consumption plan he or she can afford, just as we have done all along. 
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Let’s describe the insurance purchase in terms of the indifference-curve 
analysis we’ve been using. The two states of nature are the event that the 
loss occurs and the event that it doesn’t. The contingent consumptions are 
the values of how much money you would have in each circumstance. We 
can plot this on a graph as in Figure 12.1. 


Figure 

12.1 



Insurance. The budget line associated with the purchase of 
insurance. The insurance premium 7 allows us to give up some 
consumption in the good outcome ( C g ) in order to have more 
consumption in the bad outcome ( Cb ). 


Your endowment of contingent consumption is $25,000 in the “bad” 
state—if the loss occurs—and $35,000 in the “good” state—if it doesn’t 
occur. Insurance offers you a way to move away from this endowment 
point. If you purchase K dollars’ worth of insurance, you give up 7 K dol¬ 
lars of consumption possibilities in the good state in exchange for K — 7 K 
dollars of consumption possibilities in the bad state. Thus the consumption 
you lose in the good state, divided by the extra consumption you gain in 
the bad state, is 

A C g 7 K 7 

A C b K-'yK 1 - 7 ' 

This is the slope of the budget line through your endowment. It is just 
as if the price of consumption in the good state is 1 — 7 and the price in 
the bad state is 7 . 
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We can draw in the indifference curves that a person might have for con¬ 
tingent consumption. Here again it is very natural for indifference curves 
to have a convex shape: this means that the person would rather have a 
constant amount of consumption in each state than a large amount in one 
state and a low amount in the other. 

Given the indifference curves for consumption in each state of nature, 
we can look at the choice of how much insurance to purchase. As usual, 
this will be characterized by a tangency condition: the marginal rate of 
substitution between consumption in each state of nature should be equal 
to the price at which you can trade off consumption in those states. 

Of course, once we have a model of optimal choice, we can apply all of 
the machinery developed in early chapters to its analysis. We can examine 
how the demand for insurance changes as the price of insurance changes, 
as the wealth of the consumer changes, and so on. The theory of consumer 
behavior is perfectly adequate to model behavior under uncertainty as well 
as certainty. 


EXAMPLE: Catastrophe Bonds 

We have seen that insurance is a way to transfer wealth from good states 
of nature to bad states of nature. Of course there are two sides to these 
transactions: those who buy insurance and those who sell it. Here we focus 
on the sell side of insurance. 

The sell side of the insurance market is divided into a retail component, 
which deals directly with end buyers, and a wholesale component, in which 
insurers sell risks to other parties. The wholesale part of the market is 
known as the reinsurance market. 

Typically, the reinsurance market has relied on large investors such as 
pension funds to provide financial backing for risks. However, some rein¬ 
surers rely on large individual investors. Lloyd’s of London, one of the most 
famous reinsurance consortia, generally uses private investors. 

Recently, the reinsurance industry has been experimenting with catas¬ 
trophe bonds, which, according to some, are a more flexible way to pro¬ 
vide reinsurance. These bonds, generally sold to large institutions, have 
typically been tied to natural disasters, like earthquakes or hurricanes. 

A financial intermediary, such as a reinsurance company or an invest¬ 
ment bank, issues a bond tied to a particular insurable event, such as an 
earthquake involving, say, at least $500 million in insurance claims. If 
there is no earthquake, investors are paid a generous interest rate. But if 
the earthquake occurs and the claims exceed the amount specified in the 
bond, investors sacrifice their principal and interest. 

Catastrophe bonds have some attractive features. They can spread risks 
widely and can be subdivided indefinitely, allowing each investor to bear 
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only a small part of the risk. The money backing up the insurance is paid 
in advance, so there is no default risk to the insured. 

From the economist’s point of view, “cat bonds” are a form of state 
contingent security, that is, a security that pays off if and only if some 
particular event occurs. This concept was first introduced by Nobel laure¬ 
ate Kenneth J. Arrow in a paper published in 1952 and was long thought 
to be of only theoretical interest. But it turned out that all sorts of options 
and other derivatives could be best understood using contingent securi¬ 
ties. Now Wall Street rocket scientists draw on this 50-year-old work when 
creating exotic new derivatives such as catastrophe bonds. 


12.2 Utility Functions and Probabilities 


If the consumer has reasonable preferences about consumption in different 
circumstances, then we will be able to use a utility function to describe these 
preferences, just as we have done in other contexts. However, the fact that 
we are considering choice under uncertainty does add a special structure 
to the choice problem. In general, how a person values consumption in one 
state as compared to another will depend on the probability that the state 
in question will actually occur. In other words, the rate at which I am 
willing to substitute consumption if it rains for consumption if it doesn’t 
should have something to do with how likely I think it is to rain. The 
preferences for consumption in different states of nature will depend on the 
beliefs of the individual about how likely those states are. 

For this reason, we will write the utility function as depending on the 
probabilities as well as on the consumption levels. Suppose that we are 
considering two mutually exclusive states such as rain and shine, loss or 
no loss, or whatever. Let ci and c2 represent consumption in states 1 and 
2 , and let 7Ti and 7r2 be the probabilities that state 1 or state 2 actually 
occurs. 

If the two states are mutually exclusive, so that only one of them can 
happen, then 7T2 = 1 — 7Ti. But we’ll generally write out both probabilities 
just to keep things looking symmetric. 

Given this notation, we can write the utility function for consumption in 
states 1 and 2 as it(ci, C2,7Ti, 7 t 2 ). This is the function that represents the 
individual’s preference over consumption in each state. 


EXAMPLE: Some Examples of Utility Functions 

We can use nearly any of the examples of utility functions that we’ve seen 
up until now in the context of choice under uncertainty. One nice exam¬ 
ple is the case of perfect substitutes. Here it is natural to weight each 
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consumption by the probability that it will occur. This gives us a utility 
function of the form 


u(ci,C2,7ri,7r 2 ) = 7riCi + 7T 2 C2. 


In the context of uncertainty, this kind of expression is known as the ex¬ 
pected value. It is just the average level of consumption that you would 
get. 

Another example of a utility function that might be used to examine 
choice under uncertainty is the Cobb-Douglas utility function: 

u(ci,c 2 , 7T, 1 — 7r) = . 

Here the utility attached to any combination of consumption bundles de¬ 
pends on the pattern of consumption in a nonlinear way. 

As usual, we can take a monotonic transformation of utility and still 
represent the same preferences. It turns out that the logarithm of the 
Cobb-Douglas utility will be very convenient in what follows. This will 
give us a utility function of the form 


In u(ci, c 2 ,7Ti, 7 t 2 ) = 7Ti In ci + 7t 2 In c 2 . 


12.3 Expected Utility 

One particularly convenient form that the utility function might take is the 
following: 

u(ci,C 2 ,7Tl,7r 2 ) = 7 Tiu(ci) + 7T 2 u(c 2 ). 

This says that utility can be written as a weighted sum of some function 
of consumption in each state, v(c\) and u(c 2 ), where the weights are given 
by the probabilities 7Ti and 7r 2 . 

Two examples of this were given above. The perfect substitutes, or 
expected value utility function, had this form where v(c) = c. The Cobb- 
Douglas didn’t have this form originally, but when we expressed it in terms 
of logs, it had the linear form with v(c) = Inc. 

If one of the states is certain, so that 7Ti = 1, say, then v(c\) is the utility 
of certain consumption in state 1. Similarly, if 7r 2 = 1, w(c 2 ) is the utility 
of consumption in state 2. Thus the expression 

TTl^(ci) + 7 T 2 v(c 2 ) 

represents the average utility, or the expected utility, of the pattern of 
consumption (ci,c 2 ). 
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For this reason, we refer to a utility function with the particular form 
described here as an expected utility function, or, sometimes, a von 
Neumann-Morgenstern utility function . 2 

When we say that a consumer’s preferences can be represented by an 
expected utility function, or that the consumer’s preferences have the ex¬ 
pected utility property, we mean that we can choose a utility function that 
has the additive form described above. Of course we could also choose a dif¬ 
ferent form; any monotonic transformation of an expected utility function 
is a utility function that describes the same preferences. But the additive 
form representation turns out to be especially convenient. If the consumer’s 
preferences are described by 7Ti In c\ + n 2 In C2 they will also be described 
by 1 cj 2 ■ But the latter representation does not have the expected utility 
property, while the former does. 

On the other hand, the expected utility function can be subjected to 
some kinds of monotonic transformation and still have the expected utility 
property. We say that a function v(u) is a positive affine transfor¬ 
mation if it can be written in the form: v(u) = au + b where a > 0. A 
positive affine transformation simply means multiplying by a positive num¬ 
ber and adding a constant. It turns out that if you subject an expected 
utility function to a positive affine transformation, it not only represents 
the same preferences (this is obvious since an affine transformation is just a 
special kind of monotonic transformation) but it also still has the expected 
utility property. 

Economists say that an expected utility function is “unique up to an 
affine transformation.” This just means that you can apply an affine trans¬ 
formation to it and get another expected utility function that represents 
the same preferences. But any other kind of transformation will destroy 
the expected utility property. 


12.4 Why Expected Utility Is Reasonable 

The expected utility representation is a convenient one, but is it a rea¬ 
sonable one? Why would we think that preferences over uncertain choices 
would have the particular structure implied by the expected utility func¬ 
tion? As it turns out there are compelling reasons why expected utility is 
a reasonable objective for choice problems in the face of uncertainty. 

The fact that outcomes of the random choice are consumption goods 
that will be consumed in different circumstances means that ultimately 
only one of those outcomes is actually going to occur. Either your house 


John von Neumann was one of the major figures in mathematics in the twentieth 
century. He also contributed several important insights to physics, computer science, 
and economic theory. Oscar Morgenstern was an economist at Princeton who, along 
with von Neumann, helped to develop mathematical game theory. 
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will burn down or it won’t; either it will be a rainy day or a sunny day. The 
way we have set up the choice problem means that only one of the many 
possible outcomes is going to occur, and hence only one of the contingent 
consumption plans will actually be realized. 

This turns out to have a very interesting implication. Suppose you are 
considering purchasing fire insurance on your house for the coming year. In 
making this choice you will be concerned about wealth in three situations: 
your wealth now (co), your wealth if your house burns down (ci), and your 
wealth if it doesn’t (02). (Of course, what you really care about are your 
consumption possibilities in each outcome, but we are simply using wealth 
as a proxy for consumption here.) If tt\ is the probability that your house 
burns down and 7T2 is the probability that it doesn’t, then your preferences 
over these three different consumptions can generally be represented by a 
utility function u( 7 r 1 , 7 T 2 , Co, ci, C2). 

Suppose that we are considering the tradeoff between wealth now and 
one of the possible outcomes—say, how much money we would be willing 
to sacrifice now to get a little more money if the house burns down. Then 
this decision should be independent of how much consumption you will have 
in the other state of nature—how much wealth you will have if the house 
is not destroyed. For the house will either burn down or it won’t. If it 
happens to burn down, then the value of extra wealth shouldn’t depend 
on how much wealth you would have if it didn’t burn down. Bygones are 
bygones—so what doesn’t happen shouldn’t affect the value of consumption 
in the outcome that does happen. 

Note that this is an assumption about an individual’s preferences. It may 
be violated. When people are considering a choice between two things, the 
amount of a third thing they have typically matters. The choice between 
coffee and tea may well depend on how much cream you have. But this 
is because you consume coffee together with cream. If you considered a 
choice where you rolled a die and got either coffee, or tea, or cream, then 
the amount of cream that you might get shouldn’t affect your preferences 
between coffee and tea. Why? Because you are either getting one thing or 
the other: if you end up with cream, the fact that you might have gotten 
either coffee or tea is irrelevant. 

Thus in choice under uncertainty there is a natural kind of “indepen¬ 
dence” between the different outcomes because they must be consumed 
separately—in different states of nature. The choices that people plan to 
make in one state of nature should be independent from the choices that 
they plan to make in other states of nature. This assumption is known as 
the independence assumption. It turns out that this implies that the 
utility function for contingent consumption will take a very special struc¬ 
ture: it has to be additive across the different contingent consumption 
bundles. 

That is, if ci, C2, and C3 are the consumptions in different states of nature, 
and 7Ti, 7T2, and 7r3 are the probabilities that these three different states of 


226 UNCERTAINTY (Ch. 12) 


nature materialize, then if the independence assumption alluded to above 
is satisfied, the utility function must take the form 


C/(ci,c 2 ,c 3 ) = 7 Tim(ci) + 7r 2 u(c 2 ) + n 3 u(c 3 ). 


This is what we have called an expected utility function. Note that the 


expected utility function does indeed satisfy the property that the marginal 


rate of substitution between two goods is independent of how much there 
is of the third good. The marginal rate of substitution between goods 1 
and 2, say, takes the form 



7t 1 Au(ci)/Aci 

7t 2 Am(c 2 )/Ac 2 


This MRS depends only on how much you have of goods 1 and 2, not 
how much you have of good 3. 


12.5 Risk Aversion 

We claimed above that the expected utility function had some very con¬ 
venient properties for analyzing choice under uncertainty. In this section 
we’ll give an example of this. 

Let’s apply the expected utility framework to a simple choice problem. 
Suppose that a consumer currently has $10 of wealth and is contemplating 
a gamble that gives him a 50 percent probability of winning $5 and a 
50 percent probability of losing $5. His wealth will therefore be random: 
he has a 50 percent probability of ending up with $5 and a 50 percent 
probability of ending up with $15. The expected value of his wealth is $10, 
and the expected utility is 


^u($15) + ^u($5). 


This is depicted in Figure 12.2. The expected utility of wealth is the 
average of the two numbers u($15) and u($5), labeled .5u(5) + .5it(15) in 
the graph. We have also depicted the utility of the expected value of wealth, 
which is labeled u($10). Note that in this diagram the expected utility of 
wealth is less than the utility of the expected wealth. That is, 
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Risk aversion. For a risk-averse consumer the utility of the 
expected value of wealth, rt(10), is greater than the expected 
utility of wealth, .5u(5) + .5u(15). 


In this case we say that the consumer is risk averse since he prefers 
to have the expected value of his wealth rather than face the gamble. Of 
course, it could happen that the preferences of the consumer were such 
that he prefers a a random distribution of wealth to its expected value, in 
which case we say that the consumer is a risk lover. An example is given 
in Figure 12.3. 

Note the difference between Figures 12.2 and 12.3. The risk-averse con¬ 
sumer has a concave utility function—its slope gets flatter as wealth is in¬ 
creased. The risk-loving consumer has a convex utility function—its slope 
gets steeper as wealth increases. Thus the curvature of the utility function 
measures the consumer’s attitude toward risk. In general, the more con¬ 
cave the utility function, the more risk averse the consumer will be, and the 
more convex the utility function, the more risk loving the consumer will be. 

The intermediate case is that of a linear utility function. Here the con¬ 
sumer is risk neutral: the expected utility of wealth is the utility of its 
expected value. In this case the consumer doesn’t care about the riskiness 
of his wealth at all—only about its expected value. 


EXAMPLE: The Demand for Insurance 

Let’s apply the expected utility structure to the demand for insurance that 
we considered earlier. Recall that in that example the person had a wealth 


Figure 

12.2 
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Figure 

12.3 



Risk loving. For a risk-loving consumer the expected utility 
of wealth, ,5u(5) + .5u(15), is greater than the utility of the 
expected value of wealth, u( 10 ). 


of $35,000 and that he might incur a loss of $10,000. The probability of the 
loss was 1 percent, and it cost him to purchase K dollars of insurance. 
By examining this choice problem using indifference curves we saw that 
the optimal choice of insurance was determined by the condition that the 
MRS between consumption in the two outcomes—loss or no loss—must be 
equal to — 7/(1 — 7). Let 7 r be the probability that the loss will occur, and 
1 — 7 T be the probability that it won’t occur. 

Let state 1 be the situation involving no loss, so that the person’s wealth 
in that state is 

ci = $35, 000 - 7 K, 

and let state 2 be the loss situation with wealth 


c 2 = $35,000 - $10,000 + K - 7 K. 


Then the consumer’s optimal choice of insurance is determined by the 
condition that his MRS between consumption in the two outcomes be equal 
to the price ratio: 


7tAm(c 2 )/Ac2 _ 7 

(1 — 7t)Am(ci)/Aci 1 —7 


( 12 . 1 ) 


Now let us look at the insurance contract from the viewpoint of the 
insurance company. With probability 7 r they must pay out K , and with 
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probability (1 — 7 r) they pay out nothing. No matter what happens, they 
collect the premium <yK. Then the expected profit, P, of the insurance 
company is 

P = 7 A' — 7 tK — (1 — 7 r) • 0 = 7 AT — 7 tI\. 

Let us suppose that on the average the insurance company just breaks 
even on the contract. That is, they offer insurance at a “fair” rate, where 
“fair” means that the expected value of the insurance is just equal to its 
cost. Then we have 

P = 'yK — irK = 0, 


which implies that 7 = 7r. 

Inserting this into equation (12.1) we have 

7rA-u(c2)/Ac2 7T 

(1 — 7t)Am(ci)/Aci 1 — 7T 

Canceling the 7 r’s leaves us with the condition that the optimal amount of 
insurance must satisfy 

A ^(ci) = Au(c 2 ) , 12 2 > 

Aci Ac2 

This equation says that the marginal utility of an extra dollar of income if 
the loss occurs should be equal to the marginal utility of an extra dollar of 
income if the loss doesn’t occur. 

Let us suppose that the consumer is risk averse, so that his marginal 
utility of money is declining as the amount of money he has increases. 
Then if ci > C 2 , the marginal utility at ci would be less than the marginal 
utility at C2, and vice versa. Furthermore, if the marginal utilities of income 
are equal at ci and C2, as they are in equation ( 12 . 2 ), then we must have 
Ci = C 2 . Applying the formulas for Ci and C 2 , we find 

35,000 -jK = 25,000 + K - 7 K, 

which implies that K = $10,000. This means that when given a chance 
to buy insurance at a “fair” premium, a risk-averse consumer will always 
choose to fully insure. 

This happens because the utility of wealth in each state depends only on 
the total amount of wealth the consumer has in that state—and not what 
he might have in some other state—so that if the total amounts of wealth 
the consumer has in each state are equal, the marginal utilities of wealth 
must be equal as well. 

To sum up: if the consumer is a risk-averse, expected utility maximizer 
and if he is offered fair insurance against a loss, then he will optimally 
choose to fully insure. 
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12.6 Diversification 

Let us turn now to a different topic involving uncertainty—the benefits 
of diversification. Suppose that you are considering investing $100 in two 
different companies, one that makes sunglasses and one that makes rain¬ 
coats. The long-range weather forecasters have told you that next summer 
is equally likely to be rainy or sunny. How should you invest your money? 

Wouldn’t it make sense to hedge your bets and put some money in each? 
By diversifying your holdings of the two investments, you can get a return 
on your investment that is more certain, and therefore more desirable if 
you are a risk-averse person. 

Suppose, for example, that shares of the raincoat company and the sun¬ 
glasses company currently sell for $10 apiece. If it is a rainy summer, the 
raincoat company will be worth $20 and the sunglasses company will be 
worth $5. If it is a sunny summer, the payoffs are reversed: the sunglasses 
company will be worth $20 and the raincoat company will be worth $5. If 
you invest your entire $100 in the sunglasses company, you are taking a 
gamble that has a 50 percent chance of giving you $200 and a 50 percent 
chance of giving you $50. The same magnitude of payoffs results if you 
invest all your money in the sunglasses company: in either case you have 
an expected payoff of $125. 

But look what happens if you put half of your money in each. Then, 
if it is sunny you get $100 from the sunglasses investment and $25 from 
the raincoat investment. But if it is rainy, you get $100 from the raincoat 
investment and $25 from the sunglasses investment. Either way, you end up 
with $125 for sure. By diversifying your investment in the two companies, 
you have managed to reduce the overall risk of your investment, while 
keeping the expected payoff the same. 

Diversification was quite easy in this example: the two assets were per¬ 
fectly negatively correlated—when one went up, the other went down. Pairs 
of assets like this can be extremely valuable because they can reduce risk 
so dramatically. But, alas, they are also very hard to find. Most asset 
values move together: when GM stock is high, so is Ford stock, and so 
is Goodrich stock. But as long as asset price movements are not perfectly 
positively correlated, there will be some gains from diversification. 


12.7 Risk Spreading 

Let us return now to the example of insurance. There we considered the 
situation of an individual who had $35,000 and faced a .01 probability of 
a $10,000 loss. Suppose that there were 1000 such individuals. Then, on 
average, there would be 10 losses incurred, and thus $100,000 lost each year. 
Each of the 1000 people would face an expected loss of .01 times $10,000, or 
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$100 a year. Let us suppose that the probability that any person incurs a 
loss doesn’t affect the probability that any of the others incur losses. That 
is, let us suppose that the risks are independent. 

Then each individual will have an expected wealth of .99 x $35, 000 + 
.01 x $25,000 = $34,900. But each individual also bears a large amount of 
risk: each person has a 1 percent probability of losing $10,000. 

Suppose that each consumer decides to diversify the risk that he or she 
faces. How can they do this? Answer: by selling some of their risk to 
other individuals. Suppose that the 1000 consumers decide to insure one 
another. If anybody incurs the $10,000 loss, each of the 1000 consumers 
will contribute $10 to that person. This way, the poor person whose house 
burns down is compensated for his loss, and the other consumers have the 
peace of mind that they will be compensated if that poor soul happens 
to be themselves! This is an example of risk spreading: each consumer 
spreads his risk over all of the other consumers and thereby reduces the 
amount of risk he bears. 

Now on the average, 10 houses will burn down a year, so on the average, 
each of the 1000 individuals will be paying out $100 a year. But this is just 
on the average. Some years there might be 12 losses, and other years there 
might be 8 losses. The probability is very small that an individual would 
actually have to pay out more than $200, say, in any one year, but even so, 
the risk is there. 

But there is even a way to diversify this risk. Suppose that the home- 
owners agree to pay $100 a year for certain, whether or not there are any 
losses. Then they can build up a cash reserve fund that can be used in 
those years when there are multiple fires. They are paying $100 a year 
for certain, and on average that money will be sufficient to compensate 
homeowners for fires. 

As you can see, we now have something very much like a cooperative 
insurance company. We could add a few more features: the insurance 
company gets to invest its cash reserve fund and earn interest on its assets, 
and so on, but the essence of the insurance company is clearly present. 


12.8 Role of the Stock Market 

The stock market plays a role similar to that of the insurance market in 
that it allows for risk spreading. Recall from Chapter 11 that we argued 
that the stock market allowed the original owners of firms to convert their 
stream of returns over time to a lump sum. Well, the stock market also 
allows them to convert their risky position of having all their wealth tied 
up in one enterprise to a situation where they have a lump sum that they 
can invest in a variety of assets. The original owners of the firm have an 
incentive to issue shares in their company so that they can spread the risk 
of that single company over a large number of shareholders. 
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Similarly, the later shareholders of a company can use the stock market 
to reallocate their risks. If a company you hold shares in is adopting a 
policy that is too risky for your taste—or too conservative—you can sell 
those shares and purchase others. 

In the case of insurance, an individual was able to reduce his risk to 
zero by purchasing insurance. For a flat fee of $100, the individual could 
purchase full insurance against the $10,000 loss. This was true because 
there was basically no risk in the aggregate: if the probability of the loss 
occurring was 1 percent, then on average 10 of the 1000 people would face 
a loss—we just didn’t know which ones. 

In the case of the stock market, there is risk in the aggregate. One year 
the stock market as a whole might do well, and another year it might do 
poorly. Somebody has to bear that kind of risk. The stock market offers a 
way to transfer risky investments from people who don’t want to bear risk 
to people who are willing to bear risk. 

Of course, few people outside of Las Vegas like to bear risk: most people 
are risk averse. Thus the stock market allows people to transfer risk from 
people who don’t want to bear it to people who are willing to bear it if 
they are sufficiently compensated for it. We’ll explore this idea further in 
the next chapter. 


Summary 

1. Consumption in different states of nature can be viewed as consumption 
goods, and all the analysis of previous chapters can be applied to choice 
under uncertainty. 

2. However, the utility function that summarizes choice behavior under 
uncertainty may have a special structure. In particular, if the utility func¬ 
tion is linear in the probabilities, then the utility assigned to a gamble will 
just be the expected utility of the various outcomes. 

3. The curvature of the expected utility function describes the consumer’s 
attitudes toward risk. If it is concave, the consumer is a risk averter; and 
if it is convex, the consumer is a risk lover. 

4. Financial institutions such as insurance markets and the stock market 
provide ways for consumers to diversify and spread risks. 

REVIEW QUESTIONS 


1. How can one reach the consumption points to the left of the endowment 
in Figure 12.1? 
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2. Which of the following utility functions have the expected utility prop¬ 
erty? (a) it(ci,C2,7r 1 ,7r 2 ) = a^Ci + 7r 2 c 2 ), (b) u(c lt c 2 , tt 1; tt 2 ) = t^ci + 
7T 2 c|, (c) m(Ci, C 2 , 7Tl, 7T 2 ) = 7Tl In Cl + 7r 2 lnc 2 + 17. 

3. A risk-averse individual is offered a choice between a gamble that pays 
$1000 with a probability of 25% and $100 with a probability of 75%, or a 
payment of $325. Which would he choose? 

4. What if the payment was $320? 

5. Draw a utility function that exhibits risk-loving behavior for small gam¬ 
bles and risk-averse behavior for larger gambles. 

6. Why might a neighborhood group have a harder time self insuring for 
flood damage versus fire damage? 


APPENDIX 

Let us examine a simple problem to demonstrate the principles of expected utility 
maximization. Suppose that the consumer has some wealth w and is considering 
investing some amount i in a risky asset. This asset could earn a return of r g in 
the “good” outcome, or it could earn a return of rb in the “bad” outcome. You 
should think of r g as being a positive return—the asset increases in value, and 
rb being a negative return—a decrease in asset value. 

Thus the consumer’s wealth in the good and bad outcomes will be 

W g = (w — x) + x(l + r g ) = w + xr g 
Wb = (w — x) + x(l + rb) = w + xrb- 

Suppose that the good outcome occurs with probability n and the bad outcome 
with probability (1 — n). Then the expected utility if the consumer decides to 
invest x dollars is 

EU(x) = 7 ru(w + xr g ) + (1 — n)u(w + xn). 

The consumer wants to choose x so as to maximize this expression. 

Differentiating with respect to x, we find the way in which utility changes as 
x changes: 

EU'(x) = iru (w + xr g )r g + (1 — n)u'(w + xrb)rb- (12-3) 

The second derivative of utility with respect to x is 

EU"(x) = nu'(w + xr g )r 2 g + (1 — n)u"(w + xrb)rl- (12-4) 

If the consumer is risk averse his utility function will be concave, which implies 
that u"(w) < 0 for every level of wealth. Thus the second derivative of expected 
utility is unambiguously negative. Expected utility will be a concave function 
of x. 
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Consider the change in expected utility for the first dollar invested in the risky 
asset. This is just equation (12.3) with the derivative evaluated at x = 0: 

EU'( 0) = 7 Tu'(w)r g + (1 — Tv)u'(w)rb 
= u(w)[n r g + (1 — 7r)rb]. 

The expression inside the brackets is the expected return on the asset. If 
the expected return on the asset is negative, then expected utility must decrease 
when the first dollar is invested in the asset. But since the second derivative 
of expected utility is negative due to concavity, then utility must continue to 
decrease as additional dollars are invested. 

Hence we have found that if the expected value of a gamble is negative, a risk 
averter will have the highest expected utility at x* = 0: he will want no part of a 
losing proposition. 

On the other hand, if the expected return on the asset is positive, then in¬ 
creasing x from zero will increase expected utility. Thus he will always want to 
invest a little bit in the risky asset, no matter how risk averse he is. 

Expected utility as a function of x is illustrated in Figure 12.4. In Figure 12.4A 
the expected return is negative, and the optimal choice is x* = 0. In Figure 12.4B 
the expected return is positive over some range, so the consumer wants to invest 
some positive amount x* in the risky asset. 
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How much to invest in the risky asset. In panel A, the optimal 
investment is zero, but in panel B the consumer wants to invest a 
positive amount. 


The optimal amount for the consumer to invest will be determined by the 
condition that the derivative of expected utility with respect to x be equal to zero. 
Since the second derivative of utility is automatically negative due to concavity, 
this will be a global maximum. 

Setting (12.3) equal to zero we have 

EU'(x) = tvu (w + xr g )r g + (1 — tc)u (w + xrb)rb = 0. (12-5) 

This equation determines the optimal choice of x for the consumer in question. 
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EXAMPLE: The Effect of Taxation on Investment in Risky Assets 

How does the level of investment in a risky asset behave when you tax its return? 
If the individual pays taxes at rate t, then the after-tax returns will be (1 — t)r g 
and (1 — t)rb . Thus the first-order condition determining his optimal investment, 
x, will be 

EU'(x) = ttu'(w + x(l — t)r g ){ 1 — t)r g + (1 — n )u'(w + x{l — t)n,)(l — t)rb = 0. 
Canceling the (1 — t) terms, we have 

EU'(x) = 7 tu (w + x(l — t)r g )r g + (1 — n)u'(w + x{\ — t)rb)rb = 0. (12.6) 

Let us denote the solution to the maximization problem without taxes—when 
t = 0—by x* and denote the solution to the maximization problem with taxes 
by x. What is the relationship between x* and xl 

Your first impulse is probably to think that x * > x —that taxation of a risky 
asset will tend to discourage investment in it. But that turns out to be exactly 
wrong! Taxing a risky asset in the way we described will actually encourage 
investment in it! 

In fact, there is an exact relation between x* and x. It must be the case that 

x 


The proof is simply to note that this value of x satisfies the first-order condition 
for the optimal choice in the presence of the tax. Substituting this choice into 
equation (12.6) we have 

EU'(x) = W{w + 1 - t)r g )r g 

♦ 

+ (1 - n)u'(w + - t)r b )r b 

= tvu {w + x*r g )r g + (1 — 7 r)u (w + x*rb)rb = 0, 

where the last equality follows from the fact that x* is the optimal solution when 
there is no tax. 

What is going on here? How can imposing a tax increase the amount of 
investment in the risky asset? Here is what is happening. When the tax is 
imposed, the individual will have less of a gain in the good state, but he will 
also have less of a loss in the bad state. By scaling his original investment up 
by 1/(1 — t) the consumer can reproduce the same after-tax returns that he had 
before the tax was put in place. The tax reduces his expected return, but it also 
reduces his risk: by increasing his investment the consumer can get exactly the 
same pattern of returns he had before and thus completely offset the effect of the 
tax. A tax on a risky investment represents a tax on the gain when the return is 
positive—but it represents a subsidy on the loss when the return is negative. 
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In the last chapter we examined a model of individual behavior under 
uncertainty and the role of two economic institutions for dealing with un¬ 
certainty: insurance markets and stock markets. In this chapter we will 
further explore how stock markets serve to allocate risk. In order to do 
this, it is convenient to consider a simplified model of behavior under un¬ 
certainty. 


13.1 Mean-Variance Utility 

In the last chapter we examined the expected utility model of choice under 
uncertainty. Another approach to choice under uncertainty is to describe 
the probability distributions that are the objects of choice by a few param¬ 
eters and think of the utility function as being defined over those param¬ 
eters. The most popular example of this approach is the mean-variance 
model. Instead of thinking that a consumer’s preferences depend on the 
entire probability distribution of his wealth over every possible outcome, 
we suppose that his preferences can be well described by considering just 
a few summary statistics about the probability distribution of his wealth. 
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Let us suppose that a random variable w takes on the values uu s for 
s = 1,..., S with probability 7 t s . The mean of a probability distribution 
is simply its average value: 


S 



This is the formula for an average: take each outcome w s , weight it by the 
probability that it occurs, and sum it up over all outcomes. 1 


The variance of a probability distribution is the average value of (w — 

Mu.) 2 : 


S 



The variance measures the “spread” of the distribution and is a reasonable 
measure of the riskiness involved. A closely related measure is the stan¬ 
dard deviation, denoted by a w , which is the square root of the variance: 



The mean of a probability distribution measures its average value—what 
the distribution is centered around. The variance of the distribution mea¬ 
sures the “spread” of the distribution—how spread out it is around the 
mean. See Figure 13.1 for a graphical depiction of probability distributions 
with different means and variances. 

The mean-variance model assumes that the utility of a probability dis¬ 
tribution that gives the investor wealth w s with a probability of n s can 
be expressed as a function of the mean and variance of that distribution, 
u(fi w , cr^,). Or, if it is more convenient, the utility can be expressed as a 
function of the mean and standard deviation, u{yL w ,a w ). Since both vari¬ 
ance and standard deviation are measures of the riskiness of the wealth 
distribution, we can think of utility as depending on either one. 

This model can be thought of as a simplification of the expected utility 
model described in the preceding chapter. If the choices that are being 
made can be completely characterized in terms of their mean and vari¬ 
ance, then a utility function for mean and variance will be able to rank 
choices in the same way that an expected utility function will rank them. 
Furthermore, even if the probability distributions cannot be completely 
characterized by their means and variances, the mean-variance model may 
well serve as a reasonable approximation to the expected utility model. 

We will make the natural assumption that a higher expected return is 
good, other things being equal, and that a higher variance is bad. This 
is simply another way to state the assumption that people are typically 
averse to risk. 


1 


The Greek letter /r, mu, is pronounced “mew.” The Greek letter cr, sigma, is pro¬ 
nounced “sig-ma.” 
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Figure 

13.1 


Mean and variance. The probability distribution depicted in 
panel A has a positive mean, while that depicted in panel B has 


a negative mean. The distribution in panel A is more “spread 
out” than the one in panel B, which means that it has a larger 
variance. 


Let us use the mean-variance model to analyze a simple portfolio prob¬ 
lem. Suppose that you can invest in two different assets. One of them, 
the risk-free asset, always pays a fixed rate of return, ry. This would be 
something like a Treasury bill that pays a fixed rate of interest regardless 
of what happens. 

The other asset is a risky asset. Think of this asset as being an invest¬ 
ment in a large mutual fund that buys stocks. If the stock market does 
well, then your investment will do well. If the stock market does poorly, 
your investment will do poorly. Let m s be the return on this asset if state 
s occurs, and let 7r s be the probability that state s will occur. We’ll use 
r m to denote the expected return of the risky asset and a m to denote the 
standard deviation of its return. 

Of course you don’t have to choose one or the other of these assets; 
typically you’ll be able to divide your wealth between the two. If you hold 
a fraction of your wealth x in the risky asset, and a fraction (1 — x) in the 
risk-free asset, the expected return on your portfolio will be given by 


s 



s 


s 



Since ^ 7r s = 1, we have 


XT m + (1 - x)r f . 
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Risk and return. The budget line measures the cost of achiev¬ 
ing a larger expected return in terms of the increased standard 
deviation of the return. At the optimal choice the indifference 
curve must be tangent to this budget line. 


Figure 

13.2 


Thus the expected return on the portfolio is a weighted average of the two 
expected returns. 

The variance of your portfolio return will be given by 

S 

a 2 x = y^X xm s + (1 - x ) r f - I’xf'Ks- 

s=1 

Substituting for r x , this becomes 

s 

= 'Y^( XTn s ~ xr m) 2 K s 

s =1 
5 

= '^2,x 2 (m s - r m ) 2 ?r s 

S= 1 
2 2 

= X (T m . 

Thus the standard deviation of the portfolio return is given by 

a x = \Jx 2 cr^ = XOm- 

It is natural to assume that r m > r/, since a risk-averse investor would 
never hold the risky asset if it had a lower expected return than the risk¬ 
free asset. It follows that if you choose to devote a higher fraction of your 
wealth to the risky asset, you will get a higher expected return, but you 
will also incur higher risk. This is depicted in Figure 13.2. 
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If you set x = 1 you will put all of your money in the risky asset and you 
will have an expected return and standard deviation of If you 

set x = 0 you will put all of your wealth in the sure asset and you have an 
expected return and standard deviation of (r/,0). If you set x somewhere 
between 0 and 1, you will end up somewhere in the middle of the line 
connecting these two points. This line gives us a budget line describing the 
market tradeoff between risk and return. 

Since we are assuming that people’s preferences depend only on the mean 
and variance of their wealth, we can draw indifference curves that illustrate 
an individual’s preferences for risk and return. If people are risk averse, 
then a higher expected return makes them better off and a higher standard 
deviation makes them worse off. This means that standard deviation is a 
“bad.” It follows that the indifference curves will have a positive slope, as 
shown in Figure 13.2. 

At the optimal choice of risk and return the slope of the indifference 
curve has to equal the slope of the budget line in Figure 13.2. We might 
call this slope the price of risk since it measures how risk and return can 
be traded off in making portfolio choices. From inspection of Figure 13.2 
the price of risk is given by 


r m~r f 
& m 


(13.1) 


So our optimal portfolio choice between the sure and the risky asset could 
be characterized by saying that the marginal rate of substitution between 
risk and return must be equal to the price of risk: 


MRS = - 


AU/Acr 

AU/Ay 


r m - r f 

® m 


(13.2) 


Now suppose that there are many individuals who are choosing between 
these two assets. Each one of them has to have his marginal rate of substi¬ 
tution equal to the price of risk. Thus in equilibrium all of the individuals’ 
MRSs will be equal: when people are given sufficient opportunities to trade 
risks, the equilibrium price of risk will be equal across individuals. Risk is 
like any other good in this respect. 

We can use the ideas that we have developed in earlier chapters to ex¬ 
amine how choices change as the parameters of the problem change. All 
of the framework of normal goods, inferior goods, revealed preference, and 
so on can be brought to bear on this model. For example, suppose that an 
individual is offered a choice of a new risky asset y that has a mean return 
of r y , say, and a standard deviation of a yi as illustrated in Figure 13.3. 

If offered the choice between investing in x and investing in y , which will 
the consumer choose? The original budget set and the new budget set are 
both depicted in Figure 13.3. Note that every choice of risk and return 
that was possible in the original budget set is possible with the new budget 
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Preferences between risk and return. The asset with risk- 
return combination y is preferred to the one with combination x. 


set since the new budget set contains the old one. Thus investing in the 
asset y and the risk-free asset is definitely better than investing in x and 
the risk-free asset, since the consumer can choose a better final portfolio. 

The fact that the consumer can choose how much of the risky asset he 
wants to hold is very important for this argument. If this were an “all 
or nothing” choice where the consumer was compelled to invest all of his 
money in either x or y , we would get a very different outcome. In the 
example depicted in Figure 13.3, the consumer would prefer investing all 
of his money in x to investing all of his money in y , since x lies on a 
higher indifference curve than y. But if he can mix the risky asset with the 
risk-free asset, he would always prefer to mix with y rather than to mix 
with x. 


13.2 Measuring Risk 


We have a model above that describes the price of risk ... but how do we 
measure the amount of risk in an asset? The first thing that you would 
probably think of is the standard deviation of an asset’s return. After all, 
we are assuming that utility depends on the mean and variance of wealth, 
aren’t we? 

In the above example, where there is only one risky asset, that is exactly 
right: the amount of risk in the risky asset is its standard deviation. But if 


Figure 

13.3 
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there are many risky assets, the standard deviation is not an appropriate 
measure for the amount of risk in an asset. 

This is because a consumer’s utility depends on the mean and variance of 
total wealth—not the mean and variance of any single asset that he might 
hold. What matters is how the returns of the various assets a consumer 
holds interact to create a mean and variance of his wealth. As in the rest 
of economics, it is the marginal impact of a given asset on total utility 
that determines its value, not the value of that asset held alone. Just as 
the value of an extra cup of coffee may depend on how much cream is 
available, the amount that someone would be willing to pay for an extra 
share of a risky asset will depend on how it interacts with other assets in 
his portfolio. 

Suppose, for example, that you are considering purchasing two assets, 
and you know that there are only two possible outcomes that can happen. 
Asset A will be worth either $10 or —$5, and asset B will be worth either 
—$5 or $10. But when asset A is worth $10, asset B will be worth —$5 and 
vice versa. In other words the values of the two assets will be negatively 
correlated: when one has a large value, the other will have a small value. 

Suppose that the two outcomes are equally likely, so that the average 
value of each asset will be $2.50. Then if you don’t care about risk at all 
and you must hold one asset or the other, the most that you would be 
willing to pay for either one would be $2.50—the expected value of each 
asset. If you are averse to risk, you would be willing to pay even less than 
$2.50. 

But what if you can hold both assets? Then if you hold one share of 
each asset, you will get $5 whichever outcome arises. Whenever one asset 
is worth $10, the other is worth —$5. Thus, if you can hold both assets, 
the amount that you would be willing to pay to purchase both assets would 
be $5. 

This example shows in a vivid way that the value of an asset will depend 
in general on how it is correlated with other assets. Assets that move in 
opposite directions—that are negatively correlated with each other -are 
very valuable because they reduce overall risk. In general the value of an 
asset tends to depend much more on the correlation of its return with other 
assets than with its own variation. Thus the amount of risk in an asset 
depends on its correlation with other assets. 

It is convenient to measure the risk in an asset relative to the risk in the 
stock market as a whole. We call the riskiness of a stock relative to the 
risk of the market the beta of a stock, and denote it by the Greek letter 
p. Thus, if i represents some particular stock, we write Pi for its riskiness 
relative to the market as a whole. Roughly speaking: 

. how risky asset i is 

Pi = -:--- 

how risky the stock market is 

If a stock has a beta of 1, then it is just as risky as the market as a whole; 
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when the market moves up by 10 percent, this stock will, on the average, 
move up by 10 percent. If a stock has a beta of less than 1, then when 
the market moves up by 10 percent, the stock will move up by less than 
10 percent. The beta of a stock can be estimated by statistical methods 
to determine how sensitive the movements of one variable are relative to 
another, and there are many investment advisory services that can provide 
you with estimates of the beta of a stock. 2 


13.3 Counterparty Risk 

Financial institutions loan money not just to individuals but to each other. 
There is always the chance that one party to a loan may fail to repay the 
loan, a risk known as counterparty risk. 

To see how this works, imagine 3 banks, A, B, and C. Bank A owes B a 
billion dollars, Bank B owes C a billion dollars, and Bank C owes bank A a 
billion dollars. Now suppose that Bank A runs out of money and defaults 
on its loan. Bank B is now out a billion dollars and may not be able to 
pay C. Bank C, in turn, can’t pay A, pushing A even further in the hole. 
This sort of effect is known as financial contagion or systemic risk. It 
is a very simplified version of what happened to U.S. financial institutions 
in the Fall of 2008. 

What’s the solution? One way to deal with this sort of problem is to 
have a “lender of last resort,” which is typically a central bank, such as 
the U.S. Federal Reserve System. Bank A can go to the Federal Reserve 
and request an emergency loan of a billion dollars. It now pays off its loan 
from Bank B, which in turn pays Bank C, which in turn pays back Bank 
A. Bank A now has sufficient assets to pay back the loan from the central 
bank. 

This is, of course, an overly simplified example. Initially, there was no net 
debt among the three banks. If they had gotten together to compare assets 
and liabilities, they would have certainly discovered that fact. However, 
when assets and liabilities span thousands of financial institutions, it may 
be difficult to determine net positions, which is why lenders of last resort 
may be necessary. 


13.4 Equilibrium in a Market for Risky Assets 

We are now in a position to state the equilibrium condition for a market 
with risky assets. Recall that in a market with only certain returns, we 


The Greek letter ft beta, is pronounced “bait-uh.” For those of you who know some 
statistics, the beta of a stock is defined to be ft = cov(rj, r m )/var(f m ). That is, ft 
is the covariance of the return on the stock with the market return divided by the 
variance of the market return. 


2 
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saw that all assets had to earn the same rate of return. Here we have a 
similar principle: all assets, after adjusting for risk, have to earn the same 
rate of return. 

The catch is about adjusting for risk. How do we do that? The answer 
comes from the analysis of optimal choice given earlier. Recall that we 
considered the choice of an optimal portfolio that contained a riskless asset 
and a risky asset. The risky asset was interpreted as being a mutual fund- 
a diversified portfolio including many risky assets. In this section we’ll 
suppose that this portfolio consists of all risky assets. 

Then we can identify the expected return on this market portfolio of 
risky assets with the market expected return, r m , and identify the standard 
deviation of the market return with the market risk, a m . The return on 
the safe asset is r/, the risk-free return. 

We saw in equation (13.1) that the price of risk, p , is given by 

r m ~ r f 
P= --■ 

& m 

We said above that the amount of risk in a given asset i relative to the 
total risk in the market is denoted by Pi. This means that to measure the 
total amount of risk in asset i, we have to multiply by the market risk, a m . 
Thus the total risk in asset i is given by Pi<T m . 

What is the cost of this risk? Just multiply the total amount of risk, 
PiU m , by the price of risk. This gives us the risk adjustment: 

risk adjustment = PiCr m p 

a r m - r f 

= Pi^m -- 

O m 

= Pi(r m - r f ). 

Now we can state the equilibrium condition in markets for risky assets: 
in equilibrium all assets should have the same risk-adjusted rate of return. 
The logic is just like the logic used in Chapter 12: if one asset had a 
higher risk-adjusted rate of return than another, everyone would want to 
hold the asset with the higher risk-adjusted rate. Thus in equilibrium the 
risk-adjusted rates of return must be equalized. 

If there are two assets i and j that have expected returns r,; and rj 
and betas of Pi and Pj, we must have the following equation satisfied in 
equilibrium: 

n ~ Pi{r m ~ r f ) = rj - Pj{r m - r f ). 

This equation says that in equilibrium the risk-adjusted returns on the two 
assets must be the same—where the risk adjustment comes from multiply¬ 
ing the total risk of the asset by the price of risk. 

Another way to express this condition is to note the following. The risk¬ 
free asset, by definition, must have pf = 0. This is because it has zero risk, 
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Figure 

13.4 


EXPECTED 

RETURN 


Market line 



(slope = r m - r f ) 

r m 



r f 





BETA 

The market line. The market line depicts the combinations 
of expected return and beta for assets held in equilibrium. 


and (3 measures the amount of risk in an asset. Thus for any asset i we 
must have 

ri - Pi(r m - r f ) = r f - (3 f {r m - r f ) = r f . 

Rearranging, this equation says 

n = r f + Pi{rm - r f ) 

or that the expected return on any asset must be the risk-free return plus 
the risk adjustment. This latter term reflects the extra return that people 
demand in order to bear the risk that the asset embodies. This equation is 
the main result of the Capital Asset Pricing Model (CAPM), which 
has many uses in the study of financial markets. 


13.5 How Returns Adjust 

In studying asset markets under certainty, we showed how prices of assets 
adjust to equalize returns. Let’s look at the same adjustment process here. 

According to the model sketched out above, the expected return on any 
asset should be the risk-free return plus the risk premium: 

n = r f + Pi(r m - r f ). 

In Figure 13.4 we have illustrated this line in a graph with the different 
values of beta plotted along the horizontal axis and different expected re¬ 
turns on the vertical axis. According to our model, all assets that are held 
in equilibrium have to lie along this line. This line is called the market 
line. 
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What if some asset’s expected return and beta didn’t lie on the market 
line? What would happen? 

The expected return on the asset is the expected change in its price 
divided by its current price: 

r'i = expected value of ———. 

Po 

This is just like the definition we had before, with the addition of the word 
“expected.” We have to include “expected” now since the price of the asset 
tomorrow is uncertain. 

Suppose that you found an asset whose expected return, adjusted for 
risk, was higher than the risk-free rate: 

U - (3i{r m - r f ) > r f . 

Then this asset is a very good deal. It is giving a higher risk-adjusted 
return than the risk-free rate. 

When people discover that this asset exists, they will want to buy it. 
They might want to keep it for themselves, or they might want to buy it 
and sell it to others, but since it is offering a better tradeoff between risk 
and return than existing assets, there is certainly a market for it. 

But as people attempt to buy this asset they will bid up today’s price: 
Po will rise. This means that the expected return r* = (pi — po)/po will 
fall. How far will it fall? Just enough to lower the expected rate of return 
back down to the market line. 

Thus it is a good deal to buy an asset that lies above the market line. 
For when people discover that it has a higher return given its risk than 
assets they currently hold, they will bid up the price of that asset. 

This is all dependent on the hypothesis that people agree about the 
amount of risk in various assets. If they disagree about the expected returns 
or the betas of different assets, the model becomes much more complicated. 


EXAMPLE: Value at Risk 

It is sometimes of interest to determine the risk of a certain set of assets. 
For example, suppose that a bank holds a particular portfolio of stocks. It 
may want to estimate the probability that the portfolio will fall by more 
than a million dollars on a given day. If this probability is 5% then we 
say that the portfolio has a “one-day 5% value at risk of $1 million.” 
Typically value at risk is computed for 1 day or 2 week periods, using loss 
probabilities of 1% or 5%. 

The theoretical idea of VaR is attractive. All the challenges lie in figuring 
out ways to estimate it. But, as financial analyst Philippe Jorion has put 
it, “[T]he greatest benefit of VaR lies in the imposition of a structured 
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methodology for critically thinking about risk. Institutions that go through 
the process of computing their VaR are forced to confront their exposure 
to financial risks and to set up a proper risk management function. Thus 
the process of getting to VaR may be as important as the number itself.” 

The VaR is determined entirely by the probability distribution of the 
value of the portfolio, and this depends on the correlation of the assets in 
the portfolio. Typically, assets are positively correlated, so they all move 
up or down at once. Even worse, the distribution of asset prices tends to 
have “fat tails” so that there may be a relatively high probability of an 
extreme price movement. Ideally, one would estimate VaR using a long 
history of price movements. In practice, this is difficult to do, particularly 
for new and exotic assets. 

In the Fall of 2008 many financial institutions discovered that their VaR 
estimates were severely flawed since asset prices dropped much more than 
was anticipated. In part this was due to the fact that statistical estimates 
were based on very small samples that were gathered during a stable period 
of economic activity. The estimated values at risk understated the true risk 
of the assets in question. 


EXAMPLE: Ranking Mutual Funds 

The Capital Asset Pricing Model can be used to compare different invest¬ 
ments with respect to their risk and their return. One popular kind of 
investment is a mutual fund. These are large organizations that accept 
money from individual investors and use this money to buy and sell stocks 
of companies. The profits made by such investments are then paid out to 
the individual investors. 

The advantage of a mutual fund is that you have professionals managing 
your money. The disadvantage is they charge you for managing it. These 
fees are usually not terribly large, however, and most small investors are 
probably well advised to use a mutual fund. 

But how do you choose a mutual fund in which to invest? You want one 
with a high expected return of course, but you also probably want one with 
a minimum amount of risk. The question is, how much risk are you willing 
to tolerate to get that high expected return? 

One thing that you might do is to look at the historical performance 
of various mutual funds and calculate the average yearly return and the 
beta—the amount of risk—of each mutual fund you are considering. Since 
we haven’t discussed the precise definition of beta, you might find it hard 
to calculate. But there are books where you can look up the historical 
betas of mutual funds. 

If you plotted the expected returns versus the betas, you would get a 
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Figure 

13.5 


diagram similar to that depicted in Figure 13.5. * * 3 Note that the mutual 
funds with high expected returns will generally have high risk. The high 
expected returns are there to compensate people for bearing risk. 

One interesting thing you can do with the mutual fund diagram is to 
compare investing with professional managers to a very simple strategy 
like investing part of your money in an index fund. There are several 
indices of stock market activity like the Dow-Jones Industrial Average, or 
the Standard and Poor’s Index, and so on. The indices are typically the 
average returns on a given day of a certain group of stocks. The Standard 
and Poor’s Index, for example, is based on the average performance of 500 
large stocks in the United States. 



1 BETA 


Mutual funds. Comparing the returns on mutual fund in¬ 
vestment to the market line. 


An index fund is a mutual fund that holds the stocks that make up such 

an index. This means that you are guaranteed to get the average perfor¬ 
mance of the stocks in the index, virtually by definition. Since holding the 
average is not a very difficult thing to do—at least compared to trying to 
beat the average—index funds typically have low management fees. Since 
an index fund holds a very broad base of risky assets, it will have a beta 


3 See Michael Jensen, “The Performance of Mutual Funds in the Period 1945-1964,” 
Journal of Finance , 23 (May 1968), 389-416, for a more detailed discussion of how 
to examine mutual fund performance using the tools we have sketched out in this 
chapter. Mark Grinblatt and Sheridan Titman have examined more recent data 
in “Mutual Fund Performance: An Analysis of Quarterly Portfolio Holdings,” The 
Journal of Business, 62 (July 1989), 393-416. 
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that is very close to 1—it will be just as risky as the market as a whole, 
because the index fund holds nearly all the stocks in the market as a whole. 

How does an index fund do as compared to the typical mutual fund? 
Remember the comparison has to be made with respect to both risk and 
return of the investment. One way to do this is to plot the expected return 
and beta of a Standard and Poor’s Index fund, and draw the line connecting 
it to the risk-free rate, as in Figure 13.5. You can get any combination of 
risk and return on this line that you want just by deciding how much money 
you want to invest in the risk-free asset and how much you want to invest 
in the index fund. 

Now let’s count the number of mutual funds that plot below this line. 
These are mutual funds that offer risk and return combinations that are 
dominated by those available by the index fund/risk-free asset combina¬ 
tions. When this is done, it turns out that the vast majority of the risk- 
return combinations offered by mutual funds are below the line. The num¬ 
ber of funds that plot above the line is no more than could be expected by 
chance alone. 

But seen another way, this finding might not be too surprising. The stock 
market is an incredibly competitive environment. People are always trying 
to find undervalued stocks in order to purchase them. This means that on 
average, stocks are usually trading for what they’re really worth. If that is 
the case, then betting the averages is a pretty reasonable strategy—since 
beating the averages is almost impossible. 


Summary 

1. We can use the budget set and indifference curve apparatus developed 
earlier to examine the choice of how much money to invest in risky and 
riskless assets. 

2. The marginal rate of substitution between risk and return will have to 
equal the slope of the budget line. This slope is known as the price of risk. 

3. The amount of risk present in an asset depends to a large extent on its 
correlation with other assets. An asset that moves opposite the direction 
of other assets helps to reduce the overall risk of your portfolio. 

4. The amount of risk in an asset relative to that of the market as a whole 
is called the beta of the asset. 

5. The fundamental equilibrium condition in asset markets is that risk- 
adjusted returns have to be the same. 

6. Counterparty risk, which is the risk that the other side of a transaction 
will not pay, can also be an important risk factor. 
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REVIEW QUESTIONS 


1. If the risk-free rate of return is 6%, and if a risky asset is available with 
a return of 9% and a standard deviation of 3%, what is the maximum rate 
of return you can achieve if you are willing to accept a standard deviation 
of 2%? What percentage of your wealth would have to be invested in the 
risky asset? 

2. What is the price of risk in the above exercise? 

3. If a stock has a f3 of 1.5, the return on the market is 10%, and the risk¬ 
free rate of return is 5%, what expected rate of return should this stock 
offer according to the Capital Asset Pricing Model? If the expected value 
of the stock is $100, what price should the stock be selling for today? 


CHAPTER 



CONSUMER'S 

SURPLUS 


In the preceding chapters we have seen how to derive a consumer’s demand 
function from the underlying preferences or utility function. But in prac¬ 
tice we are usually concerned with the reverse problem—how to estimate 
preferences or utility from observed demand behavior. 

We have already examined this problem in two other contexts. In Chap¬ 
ter 5 we showed how one could estimate the parameters of a utility function 
from observing demand behavior. In the Cobb-Douglas example used in 
that chapter, we were able to estimate a utility function that described 
the observed choice behavior simply by calculating the average expendi¬ 
ture share of each good. The resulting utility function could then be used 
to evaluate changes in consumption. 

In Chapter 7 we described how to use revealed preference analysis to 
recover estimates of the underlying preferences that may have generated 
some observed choices. These estimated indifference curves can also be 
used to evaluate changes in consumption. 

In this chapter we will consider some more approaches to the problem 
of estimating utility from observing demand behavior. Although some of 
the methods we will examine are less general than the two methods we 
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examined previously, they will turn out to be useful in several applications 
that we will discuss later in the book. 

We will start by reviewing a special case of demand behavior for which 
it is very easy to recover an estimate of utility. Later we will consider more 
general cases of preferences and demand behavior. 


14.1 Demand for a Discrete Good 

Let us start by reviewing demand for a discrete good with quasilinear 
utility, as described in Chapter 6. Suppose that the utility function takes 
the form v(x ) + y and that the x-good is only available in integer amounts. 
Let us think of the y-good as money to be spent on other goods and set its 
price to 1. Let p be the price of the x-good. 

We saw in Chapter 6 that in this case consumer behavior can be described 
in terms of the reservation prices, rq = u(l) — u(0), r 2 = v(2) — u(l), and 
so on. The relationship between reservation prices and demand was very 
simple: if n units of the discrete good are demanded, then r n > p> r n+ 

To verify this, let’s look at an example. Suppose that the consumer 
chooses to consume 6 units of the x-good when its price is p. Then the 
utility of consuming (6, m — 6 p) must be at least as large as the utility of 
consuming any other bundle (cc, m — px): 

u(6) + m — 6p > v(x) + to — px. (14.1) 

In particular this inequality must hold for x = 5, which gives us 

v(6) + to — 6p > u(5) + to — 5 p. 

Rearranging, we have u(6) — u(5) = re > p. 

Equation (14.1) must also hold for x = 7. This gives us 

u(6) + to — 6p > v(7) + to — 7p, 
which can be rearranged to yield 

p > v(7) — v(6) = Tt- 

This argument shows that if 6 units of the x-good is demanded, then the 
price of the x-good must lie between rg and 7 - 7 . In general, if n units of 
the x-good are demanded at price p , then r n > p > r n + 1 , as we wanted to 
show. The list of reservation prices contains all the information necessary to 
describe the demand behavior. The graph of the reservation prices forms a 
“staircase” as shown in Figure 14.1. This staircase is precisely the demand 
curve for the discrete good. 
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14.2 Constructing Utility from Demand 

We have just seen how to construct the demand curve given the reservation 
prices or the utility function. But we can also do the same operation in 
reverse. If we are given the demand curve, we can construct the utility 
function—at least in the special case of quasilinear utility. 

At one level, this is just a trivial operation of arithmetic. The reservation 
prices are defined to be the difference in utility: 

ri = u(l) — u(0) 
r 2 = v(2) - u(l) 
r 3 = u(3) - v(2) 


If we want to calculate u(3), for example, we simply add up both sides of 
this list of equations to find 


»T T r 2 T r 3 = u(3) - u(0). 

It is convenient to set the utility from consuming zero units of the good 
equal to zero, so that u(0) = 0, and therefore v(n) is just the sum of the 
first n reservation prices. 

This construction has a nice geometrical interpretation that is illustrated 
in Figure 14.1 A. The utility from consuming n units of the discrete good is 
just the area of the first n bars which make up the demand function. This 
is true because the height of each bar is the reservation price associated 
with that level of demand and the width of each bar is 1. This area is 
sometimes called the gross benefit or the gross consumer’s surplus 
associated with the consumption of the good. 

Note that this is only the utility associated with the consumption of 
good 1. The final utility of consumption depends on the how much the 
consumer consumes of good 1 and good 2. If the consumer chooses n units 
of the discrete good, then he will have to — pn dollars left over to purchase 
other things. This leaves him with a total utility of 

v(n) + m — pn. 

This utility also has an interpretation as an area: we just take the area 
depicted in Figure 14.1A, subtract off the expenditure on the discrete good, 
and add to. 

The term v{n) — pn is called consumer’s surplus or the net con¬ 
sumer’s surplus. It measures the net benefits from consuming n units of 
the discrete good: the utility v{n) minus the reduction in the expenditure 
on consumption of the other good. The consumer’s surplus is depicted in 
Figure 14.IB. 


254 CONSUMER'S SURPLUS (Ch. 14) 


Figure 

14.1 
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A Gross surplus B Net surplus 


Reservation prices and consumer’s surplus. The gross 
benefit in panel A is the area under the demand curve. This 
measures the utility from consuming the x-good. The con¬ 
sumer’s surplus is depicted in panel B. It measures the utility 
from consuming both goods when the first good has to be pur¬ 
chased at a constant price p. 


14.3 Other Interpretations of Consumer's Surplus 

There are some other ways to think about consumer’s surplus. Suppose 
that the price of the discrete good is p. Then the value that the consumer 
places on the first unit of consumption of that good is r i, but he only has 
to pay p for it. This gives him a “surplus” of ri — p on the first unit of 
consumption. He values the second unit of consumption at r 2 , but again 
he only has to pay p for it. This gives him a surplus of 7~2 — p on that unit. 
If we add this up over all n units the consumer chooses, we get his total 
consumer’s surplus: 

CS = r\ — p + r 2 — p + ■ ■ ■ + r n — p = n + ■ ■ ■ + r n — np. 

Since the sum of the reservation prices just gives us the utility of consump¬ 
tion of good 1, we can also write this as 

CS = v(n) — pn. 

We can interpret consumer’s surplus in yet another way. Suppose that a 
consumer is consuming n units of the discrete good and paying pn dollars 
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to do so. How much money would he need to induce him to give up his 
entire consumption of this good? Let R be the required amount of money. 
Then R must satisfy the equation 

u(0) + to + R',= v(n) + m — pn. 

Since u(0) = 0 by definition, this equation reduces to 

R = v{n ) — pn, 

which is just consumer’s surplus. Hence the consumer’s surplus measures 
how much a consumer would need to be paid to give up his entire con¬ 
sumption of some good. 


14.4 From Consumer's Surplus to Consumers' Surplus 

Up until now we have been considering the case of a single consumer. If sev¬ 
eral consumers are involved we can add up each consumer’s surplus across 
all the consumers to create an aggregate measure of the consumers’ sur¬ 
plus. Note carefully the distinction between the two concepts: consumer’s 
surplus refers to the surplus of a single consumer; consumers’ surplus refers 
to the sum of the surpluses across a number of consumers. 

Consumers’ surplus serves as a convenient measure of the aggregate gains 
from trade, just as consumer’s surplus serves as a measure of the individual 
gains from trade. 


14.5 Approximating a Continuous Demand 

We have seen that the area underneath the demand curve for a discrete 
good measures the utility of consumption of that good. We can extend this 
to the case of a good available in continuous quantities by approximating 
the continuous demand curve by a staircase demand curve. The area under 
the continuous demand curve is then approximately equal to the area under 
the staircase demand. 

See Figure 14.2 for an example. In the Appendix to this chapter we show 
how to use calculus to calculate the exact area under a demand curve. 


14.6 Quasilinear Utility 

It is worth thinking about the role that quasilinear utility plays in this 
analysis. In general the price at which a consumer is willing to purchase 
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Figure 

14.2 



PRICE 


P 


X QUANTITY 

B Approximation to net surplus 



Approximating a continuous demand. The consumer’s 
surplus associated with a continuous demand curve can be ap¬ 
proximated by the consumer’s surplus associated with a discrete 
approximation to it. 


some amount of good 1 will depend on how much money he has for con¬ 
suming other goods. This means that in general the reservation prices for 
good 1 will depend on how much good 2 is being consumed. 

But in the special case of quasilinear utility the reservation prices are 
independent of the amount of money the consumer has to spend on other 
goods. Economists say that with quasilinear utility there is “no income 
effect” since changes in income don’t affect demand. This is what allows 
us to calculate utility in such a simple way. Using the area under the 
demand curve to measure utility will only be exactly correct when the 
utility function is quasilinear. 

But it may often be a good approximation. If the demand for a good 
doesn’t change very much when income changes, then the income effects 
won’t matter very much, and the change in consumer’s surplus will be a 
reasonable approximation to the change in the consumer’s utility. 1 


14.7 Interpreting the Change in Consumer's Surplus 

We are usually not terribly interested in the absolute level of consumer’s 
surplus. We are generally more interested in the change in consumer’s 


1 Of course, the change in consumer’s surplus is only one way to represent a change in 
utility—the change in the square root of consumer’s surplus would be just as good. 
But it is standard to use consumer’s surplus as a standard measure of utility. 
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surplus that results from some policy change. For example, suppose the 
price of a good changes from p' to p". How does the consumer’s surplus 
change? 

In Figure 14.3 we have illustrated the change in consumer’s surplus as¬ 
sociated with a change in price. The change in consumer’s surplus is the 
difference between two roughly triangular regions and will therefore have 
a roughly trapezoidal shape. The trapezoid is further composed of two 
subregions, the rectangle indicated by R and the roughly triangular region 
indicated by T. 



Change in consumer’s surplus. The change in consumer’s 
surplus will be the difference between two roughly triangular 
areas, and thus will have a roughly trapezoidal shape. 


Figure 

14.3 


The rectangle measures the loss in surplus due to the fact that the con¬ 
sumer is now paying more for all the units he continues to consume. After 
the price increases the consumer continues to consume x" units of the good, 
and each unit of the good is now more expensive by p" —p'. This means he 
has to spend ( p" — p')x" more money than he did before just to consume 
x" units of the good. 

But this is not the entire welfare loss. Due to the increase in the price 
of the x-good, the consumer has decided to consume less of it than he was 
before. The triangle T measures the value of the lost consumption of the 
x-good. The total loss to the consumer is the sum of these two effects: R 
measures the loss from having to pay more for the units he continues to 
consume, and T measures the loss from the reduced consumption. 
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EXAMPLE: The Change in Consumer's Surplus 

Question: Consider the linear demand curve D(p) = 20 — 2 p. When the 
price changes from 2 to 3 what is the associated change in consumer’s 
surplus? 

Answer: When p = 2, D( 2) = 16, and when p = 3, D( 3) = 14. Thus we 
want to compute the area of a trapezoid with a height of 1 and bases of 14 
and 16. This is equivalent to a rectangle with height 1 and base 14 (having 
an area of 14), plus a triangle of height 1 and base 2 (having an area of 1). 
The total area will therefore be 15. 


14.8 Compensating and Equivalent Variation 

The theory of consumer’s surplus is very tidy in the case of quasilinear 
utility. Even if utility is not quasilinear, consumer’s surplus may still be 
a reasonable measure of consumer’s welfare in many applications. Usually 
the errors in measuring demand curves outweigh the approximation errors 
from using consumer’s surplus. 

But it may be that for some applications an approximation may not 
be good enough. In this section we’ll outline a way to measure “utility 
changes” without using consumer’s surplus. There are really two separate 
issues involved. The first has to do with how to estimate utility when we 
can observe a number of consumer choices. The second has to do with how 
we can measure utility in monetary units. 

We’ve already investigated the estimation problem. We gave an example 
of how to estimate a Cobb-Douglas utility function in Chapter 6. In that 
example we noticed that expenditure shares were relatively constant and 
that we could use the average expenditure share as estimates of the Cobb- 
Douglas parameters. If the demand behavior didn’t exhibit this particular 
feature, we would have to choose a more complicated utility function, but 
the principle would be just the same: if we have enough observations on 
demand behavior and that behavior is consistent with maximizing some¬ 
thing, then we will generally be able to estimate the function that is being 
maximized. 

Once we have an estimate of the utility function that describes some 
observed choice behavior we can use this function to evaluate the impact 
of proposed changes in prices and consumption levels. At the most funda¬ 
mental level of analysis, this is the best we can hope for. All that matters 
are the consumer’s preferences; any utility function that describes the con¬ 
sumer’s preferences is as good as any other. 

However, in some applications it may be convenient to use certain mon¬ 
etary measures of utility. For example, we could ask how much money we 


COMPENSATING AND EQUIVALENT VARIATION 259 


would have to give a consumer to compensate him for a change in his con¬ 
sumption patterns. A measure of this type essentially measures a change 
in utility, but it measures it in monetary units. What are convenient ways 
to do this? 

Suppose that we consider the situation depicted in Figure 14.4. Here 
the consumer initially faces some prices (p\, 1) and consumes some bundle 
(xl , X 2 ) ■ The price of good 1 then increases from p\ to p 1 , and the consumer 
changes his consumption to (x±, £ 2 ). How much does this price change hurt 
the consumer? 



The compensating and the equivalent variations. Panel 
A shows the compensating variation (CV), and panel B shows 
the equivalent variation (EV). 


Figure 

14.4 


One way to answer this question is to ask how much money we would 
have to give the consumer after the price change to make him just as 
well off as he was before the price change. In terms of the diagram, we 
ask how far up we would have to shift the new budget line to make it tan¬ 
gent to the indifference curve that passes through the original consumption 
point (x\,X 2 )- The change in income necessary to restore the consumer to 
his original indifference curve is called the compensating variation in 
income, since it is the change in income that will just compensate the con¬ 
sumer for the price change. The compensating variation measures how 
much extra money the government would have to give the consumer if it 
wanted to exactly compensate the consumer for the price change. 

Another way to measure the impact of a price change in monetary terms 
is to ask how much money would have to be taken away from the consumer 
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before the price change to leave him as well off as he would be after the 
price change. This is called the equivalent variation in income since it 
is the income change that is equivalent to the price change in terms of 
the change in utility. In Figure 14.4 we ask how far down we must shift 
the original budget line to just touch the indifference curve that passes 
through the new consumption bundle. The equivalent variation measures 
the maximum amount of income that the consumer would be willing to pay 
to avoid the price change. 

In general the amount of money that the consumer would be willing 
to pay to avoid a price change would be different from the amount of 
money that the consumer would have to be paid to compensate him for 
a price change. After all, at different sets of prices a dollar is worth a 
different amount to a consumer since it will purchase different amounts of 
consumption. 

In geometric terms, the compensating and equivalent variations are just 
two different ways to measure “how far apart” two indifference curves are. 
In each case we are measuring the distance between two indifference curves 
by seeing how far apart their tangent lines are. In general this measure 
of distance will depend on the slope of the tangent lines—that is, on the 
prices that we choose to determine the budget lines. 

However, the compensating and equivalent variation are the same in one 
important case—the case of quasilinear utility. In this case the indifference 
curves are parallel, so the distance between any two indifference curves is 
the same no matter where it is measured, as depicted in Figure 14.5. In 
the case of quasilinear utility the compensating variation, the equivalent 
variation, and the change in consumer’s surplus all give the same measure 
of the monetary value of a price change. 


EXAMPLE: Compensating and Equivalent Variations 


1 1 

Suppose that a consumer has a utility function u(x 1 , 2 : 2 ) = He 

originally faces prices (1,1) and has income 100. Then the price of good 1 
increases to 2. What are the compensating and equivalent variations? 

We know that the demand functions for this Cobb-Douglas utility func¬ 
tion are given by 

TO 


Xi 


x 2 


2pi 

TO 

2p 2 ' 


Using this formula, we see that the consumer’s demands change from 
= (50,50) to (xi,x 2 ) = (25,50). 

To calculate the compensating variation we ask how much money would 
be necessary at prices (2,1) to make the consumer as well off as he was 
consuming the bundle (50,50)? If the prices were (2,1) and the consumer 
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Quasilinear preferences. With quasilinear preferences, the 
distance between two indifference curves is independent of the 
slope of the budget lines. 


had income to, we can substitute into the demand functions to find that 
the consumer would optimally choose the bundle (to/4,to/ 2). Setting the 
utility of this bundle equal to the utility of the bundle (50, 50) we have 

/TON 2 /TON I l l 

(t) (j) = 50 50 ‘ 

Solving for to gives us 

to = IOOn/2 « 141. 

Hence the consumer would need about 141 —100 = $41 of additional money 
after the price change to make him as well off as he was before the price 
change. 

In order to calculate the equivalent variation we ask how much money 
would be necessary at the prices (1,1) to make the consumer as well off 
as he would be consuming the bundle (25,50). Letting to stand for this 
amount of money and following the same logic as before, 

/TON 2 /ton I „_i cr i 

(y) (y) =25 ' 50 ’- 

Solving for to gives us 

m = 50 n/2 « 70. 

Thus if the consumer had an income of $70 at the original prices, he would 
be just as well off as he would be facing the new prices and having an 
income of $100. The equivalent variation in income is therefore about 
100 - 70 = $30. 


Figure 

14.5 
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EXAMPLE: Compensating and Equivalent Variation for Quasilinear 
Preferences 

Suppose that the consumer has a quasilinear utility function v(x\) + X2- 
We know that in this case the demand for good 1 will depend only on the 
price of good 1, so we write it as X\(pi). Suppose that the price changes 
from p\ to pi . What are the compensating and equivalent variations? 

At the price p\, the consumer chooses x\ = Xi{p\) and has a utility of 
v (. x i) + ixi — Pixl- At the price pi, the consumer choose x\ = x\(pi) and 
has a utility of v(x\) + m — piX\. 

Let C be the compensating variation. This is the amount of extra money 
the consumer would need after the price change to make him as well off as 
he would be before the price change. Setting these utilities equal we have 

v(xi) + m + C — pixi = + to — p\x\. 

Solving for C we have 

C = v(x\) — v(x\) +P\Xi — p\ x\. 

Let E be the equivalent variation. This is the amount of money that 
you could take away from the consumer before the price change that would 
leave him with the same utility that he would have after the price change. 
Thus it satisfies the equation 

v{x[) + m — E — p\x\ = v(xi) + to — p\Xi- 
Solving for E , we have 

E = v(x\) — w(ii) +P1X1 — p\x\. 

Note that for the case of quasilinear utility the compensating and equiv¬ 
alent variation are the same. Furthermore, they are both equal to the 
change in (net) consumer’s surplus: 

ACS = [v(x*) - p\x\} - - pixi]. 


14.9 Producer's Surplus 

The demand curve measures the amount that will be demanded at each 
price; the supply curve measures the amount that will be supplied at 
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each price. Just as the area under the demand curve measures the sur¬ 
plus enjoyed by the demanders of a good, the area above the supply curve 
measures the surplus enjoyed by the suppliers of a good. 

We’ve referred to the area under the demand curve as consumer’s sur¬ 
plus. By analogy, the area above the supply curve is known as producer’s 
surplus. The terms consumer’s surplus and producer’s surplus are some¬ 
what misleading, since who is doing the consuming and who is doing the 
producing really doesn’t matter. It would be better to use the terms “de- 
mander’s surplus” and “supplier’s surplus,” but we’ll bow to tradition and 
use the standard terminology. 

Suppose that we have a supply curve for a good. This simply measures 
the amount of a good that will be supplied at each possible price. The 
good could be supplied by an individual who owns the good in question, or 
it could be supplied by a firm that produces the good. We’ll take the latter 
interpretation so as to stick with the traditional terminology and depict 
the producer’s supply curve in Figure 14.6. If the producer is able to sell 
x* units of her product in a market at a price p *, what is the surplus she 
enjoys? 

It is most convenient to conduct the analysis in terms of the producer’s 
inverse supply curve, p s (x). This function measures what the price would 
have to be to get the producer to supply x units of the good. 
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Producer’s surplus. The net producer’s surplus is the trian¬ 
gular area to the left of the supply curve in panel A, and the 
change in producer’s surplus is the trapezoidal area in panel B. 


Figure 

14.6 


Think about the inverse supply function for a discrete good. In this case 
the producer is willing to sell the first unit of the good at price p s (l), but 
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she actually gets the market price p* for it. Similarly, she is willing to 
sell the second unit for p s ( 2), but she gets p* for it. Continuing in this 
way we see that the producer will be just willing to sell the last unit for 
p s (x*) =p*. 

The difference between the minimum amount she would be willing to sell 
the x* units for and the amount she actually sells the units for is the net 
producer’s surplus. It is the triangular area depicted in Figure 14.6A. 

Just as in the case of consumer’s surplus, we can ask how producer’s 
surplus changes when the price increases from p' to p". In general, the 
change in producer’s surplus will be the difference between two triangular 
regions and will therefore generally have the roughly trapezoidal shape 
depicted in Figure 14.6B. As in the case of consumer’s surplus, the roughly 
trapezoidal region will be composed of a rectangular region R and a roughly 
triangular region T. The rectangle measures the gain from selling the units 
previously sold anyway at p' at the higher price p". The roughly triangular 
region measures the gain from selling the extra units at the price p". This 
is analogous to the change in consumer’s surplus considered earlier. 

Although it is common to refer to this kind of change as an increase 
in producer’s surplus, in a deeper sense it really represents an increase in 
consumer’s surplus that accrues to the consumers who own the firm that 
generated the supply curve. Producer’s surplus is closely related to the 
idea of profit, but we’ll have to wait until we study firm behavior in more 
detail to spell out the relationship. 


14.10 Benefit-Cost Analysis 

We can use the consumer surplus apparatus we have developed to calculate 
the benefits and costs of various economic policies. 

For example, let us examine the impact of a price ceiling. Consider the 
situation depicted in Figure 14.7. With no intervention, the price would 
be po and the quantity sold would be qo- 

The authorities believe this price is too high and impose the price ceiling 
at p c . This reduces the amount that suppliers are willing to supply to q c 
which, in turn, reduces their producer surplus to the shaded area in the 
diagram. 

Now that there is only q c available for consumers, the question is who 
will get it? 

One assumption is that the output will go to the consumers with the 
highest willingness to pay. Let p e , the effective price, be the price that 
would induce consumers to demand q e . If everyone who is willing to pay 
more than p e gets the good, then the producer surplus will be the shaded 
area in the diagram. 

Note that the lost consumer and producer surplus is given by the trape¬ 
zoidal area in the middle of the diagram. This is the difference between 
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A price ceiling. The price ceiling at p c reduces supply to 
q e . It reduces consumer surplus to CS and producer surplus to 

PS. The effective price of the good, p e , is the price that would 
clear the market. The diagram also shows what happens with 
rationing, in which case the price of a ration coupon would be 

Pe - Pc- 


the consumer plus producer surplus in the competitive market and the 
difference in the market with the price ceiling. 

Assuming that the quantity will go to consumers with the highest will¬ 
ingness to pay is overly optimistic in most situation. Hence, we we would 
generally expect that this trapezoidal area is a lower bound on the lost 
consumer plus producer surplus in the case of a price ceiling. 


Rationing 

The diagram we have just examined can also be used to describe the social 
losses due to rationing. Instead of fixing a price ceiling of p c , suppose 
that the authorities issue ration coupons that allow for only q c units to be 
purchased. In order to purchase one unit of the good, a consumer needs to 
pay p c to the seller and produce a ration coupon. 

If the ration coupons are marketable, then they would sell for a price of 
p e — p c . This would make the the total price of the purchase equal to p e , 
which is the price that clears the market for the good being sold. 


Figure 

14.7 
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14.11 Calculating Gains and Losses 

If we have estimates of the market demand and supply curves for a good, 
it is not difficult in principle to calculate the loss in consumers’ surplus due 
to changes in government policies. For example, suppose the government 
decides to change its tax treatment of some good. This will result in a 
change in the prices that consumers face and therefore a change in the 
amount of the good that they will choose to consume. We can calculate the 
consumers’ surplus associated with different tax proposals and see which 
tax reforms generate the smallest loss. 

This is often useful information for judging various methods of taxation, 
but it suffers from two defects. First, as we’ve indicated earlier, the con¬ 
sumer’s surplus calculation is only valid for special forms of preferences— 
namely, preferences representable by a quasilinear utility function. We 
argued earlier that this kind of utility function may be a reasonable ap¬ 
proximation for goods for which changes in income lead to small changes 
in demand, but for goods whose consumption is closely related to income, 
the use of consumer surplus may be inappropriate. 

Second, the calculation of this loss effectively lumps together all the 
consumers and producers and generates an estimate of the “cost” of a 
social policy only for some mythical “representative consumer.” In many 
cases it is desirable to know not only the average cost across the population, 
but who bears the costs. The political success or failure of policies often 
depends more on the distribution of gains and losses than on the average 
gain or loss. 

Consumer’s surplus may be easy to calculate, but we’ve seen that it is 
not that much more difficult to calculate the true compensating or equiv¬ 
alent variation associated with a price change. If we have estimates of the 
demand functions of each household—or at least the demand functions for 
a sample of representative households—we can calculate the impact of a 
policy change on each household in terms of the compensating or equiva¬ 
lent variation. Thus we will have a measure of the “benefits” or “costs” 
imposed on each household by the proposed policy change. 

Mervyn King, an economist at the London School of Economics, has 
described a nice example of this approach to analyzing the implications 
of reforming the tax treatment of housing in Britain in his paper “Wel¬ 
fare Analysis of Tax Reforms Using Household Data,” Journal of Public 
Economics, 21 (1983), 183-214. 

King first examined the housing expenditures of 5,895 households and 
estimated a demand function that best described their purchases of hous¬ 
ing services. Next, he used this demand function to determine a utility 
function for each household. Finally, he used the estimated utility function 
to calculate how much each household would gain or lose under certain 
changes in the taxation of housing in Britain. The measure that he used 
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was similar to the equivalent variation described earlier in this chapter. 
The basic nature of the tax reform he studied was to eliminate tax con¬ 
cessions to owner-occupied housing and to raise rents in public housing. 
The revenues generated by these changes would be handed back to the 
households in the form of transfers proportional to household income. 

King found that 4,888 of the 5,895 households would benefit from this 
kind of reform. More importantly he could identify explicitly those house¬ 
holds that would have significant losses from the tax reform. King found, 
for example, that 94 percent of the highest income households gained from 
the reform, while only 58 percent of the lowest income households gained. 
This kind of information would allow special measures to be undertaken 
which might help in designing the tax reform in a way that could satisfy 
distributional objectives. 


Summary 

1. In the case of a discrete good and quasilinear utility, the utility associ¬ 
ated with the consumption of n units of the discrete good is just the sum 
of the first n reservation prices. 

2. This sum is the gross benefit of consuming the good. If we subtract the 
amount spent on the purchase of the good, we get the consumer’s surplus. 

3. The change in consumer’s surplus associated with a price change has a 
roughly trapezoidal shape. It can be interpreted as the change in utility 
associated with the price change. 

4. In general, we can use the compensating variation and the equivalent 
variation in income to measure the monetary impact of a price change. 

5. If utility is quasilinear, the compensating variation, the equivalent vari¬ 
ation, and the change in consumer’s surplus are all equal. Even if utility 
is not quasilinear, the change in consumer’s surplus may serve as a good 
approximation of the impact of the price change on a consumer’s utility. 

6. In the case of supply behavior we can define a producer’s surplus that 
measures the net benefits to the supplier from producing a given amount 
of output. 


REVIEW QUESTIONS 


1. A good can be produced in a competitive industry at a cost of $10 per 
unit. There are 100 consumers are each willing to pay $12 each to consume 
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a single unit of the good (additional units have no value to them.) What 
is the equilibrium price and quantity sold? The government imposes a tax 
of $1 on the good. What is the deadweight loss of this tax? 

2. Suppose that the demand curve is given by D(p) = 10 — p. What is the 
gross benefit from consuming 6 units of the good? 

3. In the above example, if the price changes from 4 to 6, what is the change 
in consumer’s surplus? 

4. Suppose that a consumer is consuming 10 units of a discrete good and 
the price increases from $5 per unit to $6. However, after the price change 
the consumer continues to consume 10 units of the discrete good. What is 
the loss in the consumer’s surplus from this price change? 


APPENDIX 


Let’s use some calculus to treat consumer’s surplus rigorously. Start with the 
problem of maximizing quasilinear utility: 

max v(x) + y 
x,y 

such that px + y = m. 

Substituting from the budget constraint we have 

maxr(i) + m — px. 

X 

The first-order condition for this problem is 

v'(x) = p. 

This means that the inverse demand function p(x) is defined by 

p(x) = v'(x). (14-2) 


Note the analogy with the discrete-good framework described in the text: the 
price at which the consumer is just willing to consume x units is equal to the 
marginal utility. 

But since the inverse demand curve measures the derivative of utility, we can 
simply integrate under the inverse demand function to find the utility function. 
Carrying out the integration we have: 

/•X 

( t ) dt= p(t ) dt. 

Jo 

Hence utility associated with the consumption of the x-good is just the area under 
the demand curve. 


PX 

v(x) — v(x) — t;(0) = / v 

Jo 
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Table 

14.1 


Comparison of CV, CS, and EV. 


Pi 

CV 

CS 

EV 

1 

0.00 

0.00 

0.00 

2 

7.18 

6.93 

6.70 

3 

11.61 

10.99 

10.40 

4 

14.87 

13.86 

12.94 

5 

17.46 

16.09 

14.87 


EXAMPLE: A Few Demand Functions 


Suppose that the demand function is linear, so that x(p) = a — bp. Then the 
change in consumer’s surplus when the price moves from p to q is given by 


f 

J p 


(a — bt) dt = at — b— = a(q — p) — b- 


Another commonly used demand function, which we examine in more detail 
in the next chapter, has the form x(p) = Ap e , where e < 0 and A is some 
positive constant. When the price changes from p to q, the associated change in 
consumer’s surplus is 

ri ^+1-19 


j 

J p 


t e ~ tl 1 9 n IL+1 — n e+1 
At e dt = A ——- = A- 9 P 


e + 1J, 


e + 1 


for t ^ -1. 

When e = —1, this demand function is x(p) = A/p, which is closely related 
to our old friend the Cobb-Douglas demand, x(p) = am/p. The change in con¬ 
sumer’s surplus for the Cobb-Douglas demand is 


dt = am In t 


= am (In q — hip). 


EXAMPLE: CV, EV, and Consumer's Surplus 

In the text we calculated the compensating and equivalent variations for the 
Cobb-Douglas utility function. In the preceding example we calculated the 
change in consumer’s surplus for the Cobb-Douglas utility function. Here we 
compare these three monetary measures of the impact on utility of a price change. 

Suppose that the price of good 1 changes from 1 to 2,3... while the price of 
good 2 stays fixed at 1 and income stays fixed at 100. Table 14.1 shows the equiv¬ 
alent variation (EV), compensating variation (CV), and the change in consumer’s 

J_ JL 

surplus (CS) for the Cobb-Douglas utility function u(x i, * 2 ) = x/° * 2 ° ■ 

Note that the change in consumer’s surplus always lies between the CV and 
the EV and that the difference between the three numbers is relatively small. It 
is possible to show that both of these facts are true in reasonably general circum¬ 
stances. See Robert Willig, “Consumer’s Surplus without Apology,” American 
Economic Review, 66 (1976), 589-597. 













CHAPTER 



MARKET 

DEMAND 


We have seen in earlier chapters how to model individual consumer choice. 
Here we see how to add up individual choices to get total market demand. 
Once we have derived the market demand curve, we will examine some of 
its properties, such as the relationship between demand and revenue. 


15.1 From Individual to Market Demand 

Let us use x}(pi,p 2 ,rrii) to represent consumer *’s demand function for 
good 1 and x?(pi 1 p 2 1 rrii) for consumer i’s demand function for good 2. 
Suppose that there are n consumers. Then the market demand for good 
1, also called the aggregate demand for good 1, is the sum of these 
individual demands over all consumers: 


n 

X 1 (p 1 ,p 2 ,m 1 ,...,m n ) = ^ ~2 x l{pi,p 2 ,mi ). 

1=1 


The analogous equation holds for good 2. 
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Since each individual’s demand for each good depends on prices and 
his or her money income, the aggregate demand will generally depend on 
prices and the distribution of incomes. However, it is sometimes convenient 
to think of the aggregate demand as the demand of some “representative 
consumer” who has an income that is just the sum of all individual incomes. 
The conditions under which this can be done are rather restrictive, and a 
complete discussion of this issue is beyond the scope of this book. 

If we do make the representative consumer assumption, the aggregate 
demand function will have the form X 1 (pi,p 2 , M), where M is the sum 
of the incomes of the individual consumers. Under this assumption, the 
aggregate demand in the economy is just like the demand of some individual 
who faces prices (pi,P 2 ) and has income M. 

If we fix all the money incomes and the price of good 2, we can illustrate 
the relation between the aggregate demand for good 1 and its price, as in 
Figure 15.1. Note that this curve is drawn holding all other prices and 
incomes fixed. If these other prices and incomes change, the aggregate 
demand curve will shift. 



The market demand curve. The market demand curve is 
the sum of the individual demand curves. 


For example, if goods 1 and 2 are substitutes, then we know that in¬ 
creasing the price of good 2 will tend to increase the demand for good 1 
whatever its price. This means that increasing the price of good 2 will 
tend to shift the aggregate demand curve for good 1 outward. Similarly, 


Figure 

15.1 
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if goods 1 and 2 are complements, increasing the price of good 2 will shift 
the aggregate demand curve for good 1 inward. 

If good 1 is a normal good for an individual, then increasing that individ¬ 
ual’s money income, holding everything else fixed, would tend to increase 
that individual’s demand, and therefore shift the aggregate demand curve 
outward. If we adopt the representative consumer model, and suppose 
that good 1 is a normal good for the representative consumer, then any 
economic change that increases aggregate income will increase the demand 
for good 1. 


15.2 The Inverse Demand Function 

We can look at the aggregate demand curve as giving us quantity as a 
function of price or as giving us price as a function of quantity. When we 
want to emphasize this latter view, we will sometimes refer to the inverse 
demand function, P(X). This function measures what the market price 
for good 1 would have to be for X units of it to be demanded. 

We’ve seen earlier that the price of a good measures the marginal rate 
of substitution (MRS) between it and all other goods; that is, the price 
of a good represents the marginal willingness to pay for an extra unit of 
the good by anyone who is demanding that good. If all consumers are 
facing the same prices for goods, then all consumers will have the same 
marginal rate of substitution at their optimal choices. Thus the inverse 
demand function, P(X), measures the marginal rate of substitution, or the 
marginal willingness to pay, of every consumer who is purchasing the good. 

The geometric interpretation of this summing operation is pretty obvious. 
Note that we are summing the demand or supply curves horizontally: for 
any given price, we add up the individuals’ quantities demanded, which, of 
course, are measured on the horizontal axis. 


EXAMPLE: Adding Up "Linear" Demand Curves 

Suppose that one individual’s demand curve is D\{p) = 20 — p and another 
individual’s is D 2 (p) = 10 — 2 p. What is the market demand function? We 
have to be a little careful here about what we mean by “linear” demand 
functions. Since a negative amount of a good usually has no meaning, we 
really mean that the individual demand functions have the form 

Di(p) = max{20 — p, 0} 

D 2 (p) = max{10 — 2 p, 0}. 

What economists call “linear” demand curves actually aren’t linear func¬ 
tions! The sum of the two demand curves looks like the curve depicted in 
Figure 15.2. Note the kink at p = 5. 
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PRICE 

A , , PRICE 

Agent 1 s 

Agent 2's 

PRICE 

Market demand = 


demand 

demand 


sum of the two 

20 



20 

c demand curves 

15 

10 

D,(P) 


15 

10 

p,(p)+D 2 (p) 

5 

\ 

D 2 (p) 

5 



*1 


X 2 

X-| + 


A 

B 


c 


The sum of two “linear” demand curves. Since the de¬ 
mand curves are only linear for positive quantities, there will 
typically be a kink in the market demand curve. 


15.3 Discrete Goods 

If a good is available only in discrete amounts, then we have seen that the 
demand for that good for a single consumer can be described in terms of 
the consumer’s reservation prices. Here we examine the market demand 
for this kind of good. For simplicity, we will restrict ourselves to the case 
where the good will be available in units of zero or one. 

In this case the demand of a consumer is completely described by his 
reservation price—the price at which he is just willing to purchase one 
unit. In Figure 15.3 we have depicted the demand curves for two con¬ 
sumers, A and B, and the market demand, which is the sum of these two 
demand curves. Note that the market demand curve in this case must 
“slope downward,” since a decrease in the market price must increase the 
number of consumers who are willing to pay at least that price. 


15.4 The Extensive and the Intensive Margin 


In preceding chapters we have concentrated on consumer choice in which 
the consumer was consuming positive amounts of each good. When the 
price changes, the consumer decides to consume more or less of one good 
or the other, but still ends up consuming some of both goods. Economists 
sometimes say that this is an adjustment on the intensive margin. 

In the reservation-price model, the consumers are deciding whether or 
not to enter the market for one of the goods. This is sometimes called an 
adjustment on the extensive margin. The slope of the aggregate demand 
curve will be affected by both sorts of decisions. 


Figure 

15.2 
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Figure 

15.3 


■ 


Agent A's 
demand 

Agent B's 
demand 

Demand 

market 

Pa 

. 


-- Pa 

. 



P B 

. - Pb 

.. 



*A 

*B 

x A + x B 



A 

B 

C 

Market demand for a discrete good. The market demand 

curve 

is the sum of the demand curves of all the consumers in 

the market, here represented by the two consumers A and B. 


We saw earlier that the adjustment on the intensive margin was in the 
“right” direction for normal goods: when the price went up, the quantity 
demanded went down. The adjustment on the extensive margin also works 
in the “right” direction. Thus aggregate demand curves can generally be 
expected to slope downward. 


15.5 Elasticity 

In Chapter 6 we saw how to derive a demand function from a consumer’s 
underlying preferences. It is often of interest to have a measure of how 
“responsive” demand is to some change in price or income. Now the first 
idea that springs to mind is to use the slope of a demand function as a 
measure of responsiveness. After all, the definition of the slope of a demand 
function is the change in quantity demanded divided by the change in price: 

slope of demand function = ——-. 

A p 

and that certainly looks like a measure of responsiveness. 

Well, it is a measure of responsiveness—but it presents some problems. 
The most important one is that the slope of a demand function depends on 
the units in which you measure price and quantity. If you measure demand 
in gallons rather than in quarts, the slope becomes four times smaller. 
Rather than specify units all the time, it is convenient to consider a unit- 
free measure of responsiveness. Economists have chosen to use a measure 
known as elasticity. 

The price elasticity of demand, e, is defined to be the percent change 
in quantity divided by the percent change in price. 1 A 10 percent increase 

1 The Greek letter e, epsilon, is pronounced “eps-i-lon.” 
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in price is the same percentage increase whether the price is measured in 
American dollars or English pounds; thus measuring increases in percentage 
terms keeps the definition of elasticity unit-free. 

In symbols the definition of elasticity is 

g= A q/q 

A p/p' 

Rearranging this definition we have the more common expression: 

pAq 
e = —i— • 
qAp 

Hence elasticity can be expressed as the ratio of price to quantity multiplied 
by the slope of the demand function. In the Appendix to this chapter we 
describe elasticity in terms of the derivative of the demand function. If 
you know calculus, the derivative formulation is the most convenient way 
to think about elasticity. 

The sign of the elasticity of demand is generally negative, since demand 
curves invariably have a negative slope. However, it is tedious to keep 
referring to an elasticity of minus something-or-other, so it is common in 
verbal discussion to refer to elasticities of 2 or 3, rather than —2 or —3. We 
will try to keep the signs straight in the text by referring to the absolute 
value of elasticity, but you should be aware that verbal treatments tend to 
drop the minus sign. 

Another problem with negative numbers arises when we compare magni¬ 
tudes. Is an elasticity of —3 greater or less than an elasticity of —2? From 
an algebraic point of view —3 is smaller than —2, but economists tend to 
say that the demand with the elasticity of —3 is “more elastic” than the 
one with —2. In this book we will make comparisons in terms of absolute 
value so as to avoid this kind of ambiguity. 


EXAMPLE: The Elasticity of a Linear Demand Curve 

Consider the linear demand curve, q = a — bp, depicted in Figure 15.4. 
The slope of this demand curve is a constant, —b. Plugging this into the 
formula for elasticity we have 


—bp —bp 

q a — bp 

When p = 0, the elasticity of demand is zero. When q = 0, the elasticity 
of demand is (negative) infinity. At what value of price is the elasticity of 
demand equal to —1? 
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Figure 

15.4 


PRICE 

N = “ 



a/2b 

l e l>i 

M = 1 



|e|<1 

N = ° 




a/2 

QUANTITY 

The elasticity of a linear demand curve. Elasticity is 
infinite at the vertical intercept, one halfway down the curve, 
and zero at the horizontal intercept. 


To find such a price, we write down the equation 


—bp 
a — bp 


-1 


and solve it for p. This gives 


a 


P= W 


which, as we see in Figure 15.4, is just halfway down the demand curve. 


15.6 Elasticity and Demand 

If a good has an elasticity of demand greater than 1 in absolute value we say 
that it has an elastic demand. If the elasticity is less than 1 in absolute 
value we say that it has an inelastic demand. And if it has an elasticity 
of exactly —1, we say it has unit elastic demand. 

An elastic demand curve is one for which the quantity demanded is very 
responsive to price: if you increase the price by 1 percent, the quantity 
demanded decreases by more than 1 percent. So think of elasticity as the 
responsiveness of the quantity demanded to price, and it will be easy to 
remember what elastic and inelastic mean. 

In general the elasticity of demand for a good depends to a large extent 
on how many close substitutes it has. Take an extreme case—our old friend, 
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the red pencils and blue pencils example. Suppose that everyone regards 
these goods as perfect substitutes. Then if some of each of them are bought, 
they must sell for the same price. Now think what would happen to the 
demand for red pencils if their price rose, and the price of blue pencils 
stayed constant. Clearly it would drop to zero—the demand for red pencils 
is very elastic since it has a perfect substitute. 

If a good has many close substitutes, we would expect that its demand 
curve would be very responsive to its price changes. On the other hand, if 
there are few close substitutes for a good, it can exhibit a quite inelastic 
demand. 


15.7 Elasticity and Revenue 


Revenue is just the price of a good times the quantity sold of that good. 
If the price of a good increases, then the quantity sold decreases, so revenue 
may increase or decrease. Which way it goes obviously depends on how 
responsive demand is to the price change. If demand drops a lot when the 
price increases, then revenue will fall. If demand drops only a little when the 
price increases, then revenue will increase. This suggests that the direction 
of the change in revenue has something to do with the elasticity of demand. 

Indeed, there is a very useful relationship between price elasticity and 
revenue change. The definition of revenue is 

R = pq. 

If we let the price change to p + A p and the quantity change to q + A q, we 
have a new revenue of 

R! = (p + Ap)(q + Aq) 

= pq+ qAp + pAq + ApAq. 

Subtracting R from R' we have 

A R = qAp + pAq + ApAq. 

For small values of A p and Aq , the last term can safely be neglected, leaving 
us with an expression for the change in revenue of the form 

A R = qAp + pAq. 


That is, the change in revenue is roughly equal to the quantity times the 
change in price plus the original price times the change in quantity. If we 
want an expression for the rate of change of revenue per change in price, 
we just divide this expression by A p to get 


A R 
A p 


Aq 
A p 
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Figure 

15.5 


This is treated geometrically in Figure 15.5. The revenue is just the 
area of the box: price times quantity. When the price increases, we add a 
rectangular area on the top of the box, which is approximately qAp, but 
we subtract an area on the side of the box, which is approximately pAq. 
For small changes, this is exactly the expression given above. (The leftover 
part, ApAq , is the little square in the corner of the box, which will be very 
small relative to the other magnitudes.) 



How revenue changes when price changes. The change 
in revenue is the sum of the box on the top minus the box on 
the side. 


When will the net result of these two effects be positive? That is, when 
do we satisfy the following inequality: 




Rearranging we have 


pa« > -i 

qAp 


The left-hand side of this expression is e(p), which is a negative number. 
Multiplying through by —1 reverses the direction of the inequality to give 
us: 


Kp)I < 1. 
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Thus revenue increases when price increases if the elasticity of demand 
is less than 1 in absolute value. Similarly, revenue decreases when price 
increases if the elasticity of demand is greater than 1 in absolute value. 
Another way to see this is to write the revenue change as we did above: 

A R = pAq + qAp > 0 


and rearrange this to get 


pAq 

qAp 


Kp)I < 1. 


Yet a third way to see this is to take the formula for AR/Ap and rear¬ 
range it as follows: 


AR 

Ap 


= q + p 


Aq 

Ap 


= q 


i+ p -p-' 

qAp_ 

= q [i + eb)] • 


Since demand elasticity is naturally negative, we can also write this ex¬ 
pression as 


AR 

Ap 


9[1-Kp)|]. 


In this formula it is easy to see how revenue responds to a change in price: 
if the absolute value of elasticity is greater than 1, then AR/Ap must be 
negative and vice versa. 

The intuitive content of these mathematical facts is not hard to remem¬ 
ber. If demand is very responsive to price—that is, it is very elastic—then 
an increase in price will reduce demand so much that revenue will fall. 
If demand is very unresponsive to price—it is very inelastic—then an in¬ 
crease in price will not change demand very much, and overall revenue will 
increase. The dividing line happens to be an elasticity of —1. At this point 
if the price increases by 1 percent, the quantity will decrease by 1 percent, 
so overall revenue doesn’t change at all. 


EXAMPLE: Strikes and Profits 

In 1979 the United Farm Workers called for a strike against lettuce growers 
in California. The strike was highly effective: the production of lettuce was 
cut almost in half. But the reduction in the supply of lettuce inevitably 
caused an increase in the price of lettuce. In fact, during the strike the price 
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of lettuce rose by nearly 400 percent. Since production halved and prices 
quadrupled, the net result of was almost a doubling producer profits! 2 

One might well ask why the producers eventually settled the strike. The 
answer involves short-run and long-run supply responses. Most of the let¬ 
tuce consumed in U.S. during the winter months is grown in the Imperial 
Valley. When the supply of this lettuce was drastically reduced in one 
season, there wasn’t time to replace it with lettuce from elsewhere so the 
market price of lettuce skyrocketed. If the strike had held for several sea¬ 
sons, lettuce could be planted in other regions. This increase in supply from 
other sources would tend reduce the price of lettuce back to its normal level, 
thereby reducing the profits of the Imperial Valley growers. 


15.8 Constant Elasticity Demands 

What kind of demand curve gives us a constant elasticity of demand? In 
a linear demand curve the elasticity of demand goes from zero to infinity, 
which is not exactly what you would call constant, so that’s not the answer. 

We can use the revenue calculation described above to get an example. 
We know that if the elasticity is 1 at price p , then the revenue will not 
change when the price changes by a small amount. So if the revenue remains 
constant for all changes in price, we must have a demand curve that has 
an elasticity of —1 everywhere. 

But this is easy. We just want price and quantity to be related by the 
formula 

pq = R, 

which means that 


R 



is the formula for a demand function with constant elasticity of — 1. The 
graph of the function q = R/p is given in Figure 15.6. Note that price 
times quantity is constant along the demand curve. 

The general formula for a demand with a constant elasticity of e turns 
out to be 

q = Ap e , 

where A is an arbitrary positive constant and e, being an elasticity, will 
typically be negative. This formula will be useful in some examples later 
on. 

A convenient way to express a constant elasticity demand curve is to 
take logarithms and write 

In q = In A + e In p. 


2 


See Colin Carter, et. at, “Agricultural Labor Strikes and Farmers’ Incomes,” Eco¬ 
nomic Inquiry, 25, 1987,121-133. 
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Unit elastic demand. For this demand curve price times 
quantity is constant at every point. Thus the demand curve has 
a constant elasticity of —1. 


In this expression, the logarithm of q depends in a linear way on the loga¬ 
rithm of p. 


15.9 Elasticity and Marginal Revenue 


In section 15.7 we examined how revenue changes when you change the 
price of a good, but it is often of interest to consider how revenue changes 
when you change the quantity of a good. This is especially useful when we 
are considering production decisions by firms. 

We saw earlier that for small changes in price and quantity, the change 
in revenue is given by 

A R = pAq + qAp. 


If we divide both sides of this expression by A q, we get the expression for 

marginal revenue: 

A R A p 

= P + q-T-- 


MR = 


A q 


[ A q 


There is a useful way to rearrange this formula. Note that we can also 
write this as 


A R 
A q 


= p 


1 + 


qAp 

pAq. 


Figure 

15.6 
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What is the second term inside the brackets? Nope, it’s not elasticity, but 
you’re close. It is the reciprocal of elasticity: 

1 1 qAp 

e pAq pAq' 

q^p 

Thus the expression for marginal revenue becomes 


A R 
A q 


p(q) 


i + 


e(q)\ 


(Here we’ve written p(q) and e(q) to remind ourselves that both price and 
elasticity will typically depend on the level of output.) 

When there is a danger of confusion due to the fact that elasticity is a 
negative number we will sometimes write this expression as 


A R 
A q 


p(q) 



i 

RejJ. ' 


This means that if elasticity of demand is —1, then marginal revenue 
is zero—revenue doesn’t change when you increase output. If demand is 
inelastic, then |e| is less than 1, which means l/|e| is greater than 1. Thus 
1 — 1/|e| is negative, so that revenue will decrease when you increase output. 

This is quite intuitive. If demand isn’t very responsive to price, then you 
have to cut prices a lot to increase output: so revenue goes down. This 
is all completely consistent with the earlier discussion about how revenue 
changes as we change price, since an increase in quantity means a decrease 
in price and vice versa. 


EXAMPLE: Setting a Price 

Suppose that you were in charge of setting a price for some product that 
you were producing and that you had a good estimate of the demand curve 
for that product. Let us suppose that your goal is to set a price that 
maximizes profits—revenue minus costs. Then you would never want to 
set it where the elasticity of demand was less than 1—you would never 
want to set a price where demand was inelastic. 

Why? Consider what would happen if you raised your price. Then your 
revenues would increase—since demand was inelastic—and the quantity 
you were selling would decrease. But if the quantity sold decreases, then 
your production costs must also decrease, or at least, they can’t increase. 
So your overall profit must rise, which shows that operating at an inelastic 
part of the demand curve cannot yield maximal profits. 
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15.10 Marginal Revenue Curves 

We saw in the last section that marginal revenue is given by 


A R 
A q 


p{q ) + 


A p(q) 
Aq q 


or 


A R 
A q 


p(q) 


k(«)|J ’ 


We will find it useful to plot these marginal revenue curves. First, note 
that when quantity is zero, marginal revenue is just equal to the price. 
For the first unit of the good sold, the extra revenue you get is just the 
price. But after that, the marginal revenue will be less than the price, since 
Ap/Aq is negative. 

Think about it. If you decide to sell one more unit of output, you will 
have to decrease the price. But this reduction in price reduces the revenue 
you receive on all the units of output that you were selling already. Thus 
the extra revenue you receive will be less than the price that you get for 
selling the extra unit. 

Let’s consider the special case of the linear (inverse) demand curve: 


P{q) = a — bq. 


Here it is easy to see that the slope of the inverse demand curve is constant: 


A p 
A ~q 


= -b. 


Thus the formula for marginal revenue becomes 


A R 
A q 


p{q) + 


A p(q) 
Aq q 


= P{q) ~ bq 
— a — bq — bq 
= a — 2 bq. 


This marginal revenue curve is depicted in Figure 15.7A. The marginal 
revenue curve has the same vertical intercept as the demand curve, but has 
twice the slope. Marginal revenue is negative when q > a/2b. The quantity 
a/2 b is the quantity at which the elasticity is equal to —1. At any larger 
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Figure 

15.7 
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quantity demand will be inelastic, which implies that marginal revenue is 
negative. 

The constant elasticity demand curve provides another special case of 
the marginal revenue curve. (See Figure 15.7B.) If the elasticity of demand 
is constant at e(q) = e, then the marginal revenue curve will have the form 


MR = p(q) 



Since the term in brackets is constant, the marginal revenue curve is some 
constant fraction of the inverse demand curve. When |e| = 1, the marginal 
revenue curve is constant at zero. When |e| > 1, the marginal revenue curve 
lies below the inverse demand curve, as depicted. When |e| < 1, marginal 
revenue is negative. 


15.11 Income Elasticity 

Recall that the price elasticity of demand is defined as 

% change in quantity demanded 

price elasticity of demand = -— - ; — : — ; -. 

Vo change m price 

This gives us a unit-free measure of how the amount demanded responds 
to a change in price. 
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The income elasticity of demand is used to describe how the quantity 
demanded responds to a change in income; its definition is 

... r % change in quantity 

income elasticity oi demand = — - ; — ; - 

% change m income 

Recall that a normal good is one for which an increase in income leads 
to an increase in demand; so for this sort of good the income elasticity 
of demand is positive. An inferior good is one for which an increase in 
income leads to a decrease in demand; for this sort of good, the income 
elasticity of demand is negative. Economists sometimes use the term lux¬ 
ury goods. These are goods that have an income elasticity of demand 
that is greater than 1: a 1 percent increase in income leads to more than 
a 1 percent increase in demand for a luxury good. 

As a general rule of thumb, however, income elasticities tend to clus¬ 
ter around 1. We can see the reason for this by examining the budget 
constraint. Write the budget constraints for two different levels of income: 


Pix[ + p 2 x 2 = m! 
P\x\ + p 2 x° = m°. 


Subtract the second equation from the first and let A denote differences, 
as usual: 

Pi Axi + p 2 Ax 2 = Am. 

Now multiply and divide price i by Xi/Xi and divide both sides by m: 

P\X\ Ax\ p 2 x 2 Ax 2 Am 

m x i m x 2 m 


Finally, divide both sides by A m/m, and use s,; = ptXi/m to denote the 
expenditure share of good i. This gives us our final equation, 

Axi/xi Ax 2 /x 2 

Sl N—7-^ S2 A —7— — 

Am/m Am/m 


This equation says that the weighted average of the income elasticities is 
1, where the weights are the expenditure shares. Luxury goods that have 
an income elasticity greater than 1 must be counterbalanced by goods that 
have an income elasticity less than 1, so that “on average” income elastic¬ 
ities are about 1. 


Summary 

1. The market demand curve is simply the sum of the individual demand 


curves. 
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2. The reservation price measures the price at which a consumer is just 
indifferent between purchasing or not purchasing a good. 

3. The demand function measures quantity demanded as a function of 
price. The inverse demand function measures price as a function of quan¬ 
tity. A given demand curve can be described in either way. 

4. The elasticity of demand measures the responsiveness of the quantity 
demanded to price. It is formally defined as the percent change in quantity 
divided by the percent change in price. 

5. If the absolute value of the elasticity of demand is less than 1 at some 
point, we say that demand is inelastic at that point. If the absolute value 
of elasticity is greater than 1 at some point, we say demand is elastic at 
that point. If the absolute value of the elasticity of demand at some point 
is exactly 1, we say that the demand has unitary elasticity at that point. 

6. If demand is inelastic at some point, then an increase in quantity will 
result in a reduction in revenue. If demand is elastic, then an increase in 
quantity will result in an increase in revenue. 

7. The marginal revenue is the extra revenue one gets from increasing 
the quantity sold. The formula relating marginal revenue and elasticity 
is MR = p[ 1 + 1/e] = p[ 1 — 1/|e|]. 

8. If the inverse demand curve is a linear function p(q) = a — bq, then the 
marginal revenue is given by MR = a — 2 bq. 

9. Income elasticity measures the responsiveness of the quantity demanded 
to income. It is formally defined as the percent change in quantity divided 
by the percent change in income. 


REVIEW QUESTIONS 


1. If the market demand curve is D(p) = 100 — .5p, what is the inverse 
demand curve? 

2. An addict’s demand function for a drug may be very inelastic, but the 
market demand function might be quite elastic. How can this be? 

3. If D(p) = 12 — 2p, what price will maximize revenue? 

4. Suppose that the demand curve for a good is given by D(p) = 100/p. 
What price will maximize revenue? 

5. True or false? In a two good model if one good is an inferior good the 
other good must be a luxury good. 
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APPENDIX 


In terms of derivatives the price elasticity of demand is defined by 

= Pdq 

q dp' 

In the text we claimed that the formula for a constant elasticity demand curve 
was q = Ap e . To verify that this is correct, we can just differentiate it with 
respect to price: 

dq . e —1 
— = eAp 
dp 

and multiply by price over quantity: 

Pdq p e -l 

- — — —eAp = e. 

q dp Ap e 

Everything conveniently cancels, leaving us with e as required. 

A linear demand curve has the formula q(p) = a —bp. The elasticity of demand 
at a point p is given by 

p dq —bp 

q dp a — bp 

When p is zero, the elasticity is zero. When q is zero, the elasticity is infinite. 

Revenue is given by R(p) = pq(p). To see how revenue changes as p changes 
we differentiate revenue with respect to p to get 

R’(p) =pq(p) + q{p). 

Suppose that revenue increases when p increases. Then we have 

R'(P ) =P < T' +i{p) > 0. 


Rearranging, we have 


pdq 
q dp 


- 1 . 


Recalling that dq/dp is negative and multiplying through by —1, we find 


M < i- 


Hence if revenue increases when price increases, we must be at an inelastic part 
of the demand curve. 
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Figure 
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EXAMPLE: The Laffer Curve 

In this section we’ll consider some simple elasticity calculations that can be used 
to examine an issue of considerable policy interest, namely, how tax revenue 
changes when the tax rate changes. 

Suppose that we graph tax revenue versus the tax rate. If the tax rate is zero, 
then tax revenues are zero; if the tax rate is 1, nobody will want to demand 
or supply the good in question, so the tax revenue is also zero. Thus revenue 
as a function of the tax rate must first increase and eventually decrease. (Of 
course, it can go up and down several times between zero and 1, but we’ll ignore 
this possibility to keep things simple.) The curve that relates tax rates and tax 
revenues is known as the Laffer curve, depicted in Figure 15.8. 

The interesting feature of the Laffer curve is that it suggests that when the tax 
rate is high enough, an increase in the tax rate will end up reducing the revenues 
collected. The reduction in the supply of the good due to the increase in the tax 
rate can be so large that tax revenue actually decreases. This is called the Laffer 
effect, after the economist who popularized this diagram in the early eighties. It 
has been said that the virtue of the Laffer curve is that you can explain it to a 
congressman in half an hour and he can talk about it for six months. Indeed, 
the Laffer curve figured prominently in the debate over the effect of the 1980 tax 
cuts. The catch in the above argument is the phrase “high enough.” Just how 
high does the tax rate have to be for the Laffer effect to work? 

To answer this question let’s consider the following simple model of the labor 
market. Suppose that firms will demand zero labor if the wage is greater than 
w and an arbitrarily large amount of labor if the wage is exactly TO. This means 
that the demand curve for labor is flat at some wage w. Suppose that the supply 
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curve of labor, S(p), has a conventional upward slope. The equilibrium in the 
labor market is depicted in Figure 15.9. 
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Labor market. Equilibrium in the labor market with a horizontal 
demand curve for labor. When labor income is taxed, less will be 
supplied at each wage rate. 


Figure 

15.9 


If we put a tax on labor at the rate t, then if the firm pays w, the worker 
only gets w = (1 — t)w. Thus the supply curve of labor tilts to the left, and the 
amount of labor sold drops, as in Figure 15.9. The after-tax wage has gone down 
and this has discouraged the sale of labor. So far so good. 

Tax revenue, T, is therefore given by the formula 


T = twS(w), 


where w = (1 — t)w and S(w) is the supply of labor. 

In order to see how tax revenue changes as we change the tax rate we differ¬ 
entiate this formula with respect to t to find 


dT 

dt 



dS(w) 

dw 


w + S(w) 


w. 


(15.1) 


(Note the use of the chain rule and the fact that dw/dt = —w.) 

The Laffer effect occurs when revenues decline when t increases—that is, when 
this expression is negative. Now this clearly means that the supply of labor is 
going to have to be quite elastic- it has to drop a lot when the tax increases. So 
let’s try to see what values of elasticity will make this expression negative. 
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In order for equation (15.1) to be negative, we must have 

_ t dS(w)_ + < 0 

aw 

Transposing yields 

,dS(w)_ , 

t ’ w > S(w), 
aw 

and dividing both sides by tS(w) gives 

dS(w) w 1 

dw S(w) t 

Multiplying both sides by (1 — t) and using the fact that w = (1 — t)w gives us 

dS w 1 — t 
dw S > t 

The left-hand side of this expression is the elasticity of labor supply. We have 
shown that the Laffer effect can only occur if the elasticity of labor supply is 
greater than (1 — t)/t. 

Let us take an extreme case and suppose that the tax rate on labor income 
is 50 percent. Then the Laffer effect can occur only when the elasticity of labor 
supply is greater than 1. This means that a 1 percent reduction in the wage 
would lead to more than a 1 percent reduction in the labor supply. This is a very 
large response. 

Econometricians have often estimated labor-supply elasticities, and about the 
largest value anyone has ever found has been around 0.2. So the Laffer effect 
seems pretty unlikely for the kinds of tax rates that we have in the United States. 
However, in other countries, such as Sweden, tax rates go much higher, and there 
is some evidence that the Laffer phenomenon may have occurred. 3 


EXAMPLE: Another Expression for Elasticity 

Here is another expression for elasticity that is sometimes useful. It turns out 
that elasticity can also be expressed as 

din Q 
dlnP 

The proof involves repeated application of the chain rule. We start by noting 
that 


d In Q d In Q dQ 

d In P dQ d In P 

1 dQ 
Q d In P 


(15.2) 


3 


See Charles E. Stuart, “Swedish Tax Rates, Labor Supply, and Tax Revenues,” Jour¬ 
nal of Political Economy , 89, 5 (October 1981), 1020-38. 
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We also note that 

dQ dQ d In P 
dP = din P dP 
dQ 1 
~ dlnPP’ 

which implies that 

dQ _ dQ 
dlnP _ P dP ' 

Substituting this into equation (15.2), we have 

d in Q 1 dQ 
dlnP = QdP P = £ ’ 

which is what we wanted to establish. 

Thus elasticity measures the slope of the demand curve plotted on log-log 
paper: how the log of the quantity changes as the log of the price changes. 






CHAPTER 



EQUILIBRIUM 


In preceding chapters we have seen how to construct individual demand 
curves by using information about preferences and prices. In Chapter 15 
we added up these individual demand curves to construct market demand 
curves. In this chapter we will describe how to use these market demand 
curves to determine the equilibrium market price. 

In Chapter 1 we said that there were two fundamental principles of micro- 
economic analysis. These were the optimization principle and the equilib¬ 
rium principle. Up until now we have been studying examples of the opti¬ 
mization principle: what follows from the assumption that people choose 
their consumption optimally from their budget sets. In later chapters we 
will continue to use optimization analysis to study the profit-maximization 
behavior of firms. Finally, we combine the behavior of consumers and firms 
to study the equilibrium outcomes of their interaction in the market. 

But before undertaking that study in detail it seems worthwhile at this 
point to give some examples of equilibrium analysis—how the prices adjust 
so as to make the demand and supply decisions of economic agents com¬ 
patible. In order to do so, we will have to briefly consider the other side of 
the market—the supply side. 
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16.1 Supply 

We have already seen a few examples of supply curves. In Chapter 1 
we looked at a vertical supply curve for apartments. In Chapter 9 we 
considered situations where consumers would choose to be net suppliers 
or demanders of goods that they owned, and we analyzed labor-supply 
decisions. 

In all of these cases the supply curve simply measured how much the 
consumer was willing to supply of a good at each possible market price. 
Indeed, this is the definition of the supply curve: for each p , we determine 
how much of the good will be supplied, S(p). In the next few chapters we 
will discuss the supply behavior of firms. However, for many purposes, it is 
not really necessary to know where the supply curve or the demand curve 
comes from in terms of the optimizing behavior that generates the curves. 
For many problems the fact that there is a functional relationship between 
the price and the quantity that consumers want to demand or supply at 
that price is enough to highlight important insights. 


16.2 Market Equilibrium 

Suppose that we have a number of consumers of a good. Given their 
individual demand curves we can add them up to get a market demand 
curve. Similarly, if we have a number of independent suppliers of this 
good, we can add up their individual supply curves to get the market 
supply curve. 

The individual demanders and suppliers are assumed to take prices as 
given—outside of their control—and simply determine their best response 
given those market prices. A market where each economic agent takes 
the market price as outside of his or her control is called a competitive 
market. 

The usual justification for the competitive-market assumption is that 
each consumer or producer is a small part of the market as a whole and 
thus has a negligible effect on the market price. For example, each supplier 
of wheat takes the market price to be more or less independent of his actions 
when he determines how much wheat he wants to produce and supply to 
the market. 

Although the market price may be independent of any one agent’s actions 
in a competitive market, it is the actions of all the agents together that 
determine the market price. The equilibrium price of a good is that 
price where the supply of the good equals the demand. Geometrically, this 
is the price where the demand and the supply curves cross. 

If we let D(p) be the market demand curve and S(p) the market supply 
curve, the equilibrium price is the price p* that solves the equation 

D(p*) = S(p*). 
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The solution to this equation, p*, is the price where market demand equals 
market supply. 

Why should this be an equilibrium price? An economic equilibrium 
is a situation where all agents are choosing the best possible action for 
themselves and each person’s behavior is consistent with that of the others. 
At any price other than an equilibrium price, some agents’ behaviors would 
be infeasible, and there would therefore be a reason for their behavior to 
change. Thus a price that is not an equilibrium price cannot be expected to 
persist since at least some agents would have an incentive to change their 
behavior. 

The demand and supply curves represent the optimal choices of the 
agents involved, and the fact that they are equal at some price p* indi¬ 
cates that the behaviors of the demanders and suppliers are compatible. 
At any price other than the price where demand equals supply these two 
conditions will not be met. 

For example, suppose that we consider some price p' < p* where demand 
is greater than supply. Then some suppliers will realize that they can sell 
their goods at more than the going price p' to the disappointed demanders. 
As more and more suppliers realize this, the market price will be pushed 
up to the point where demand and supply are equal. 

Similarly if p' > p*, so that demand is less than supply, then some 
suppliers will not be able to sell the amount that they expected to sell. 
The only way in which they will be able to sell more output will be to offer 
it at a lower price. But if all suppliers are selling the identical goods, and if 
some supplier offers to sell at a lower price, the other suppliers must match 
that price. Thus excess supply exerts a downward pressure on the market 
price. Only when the amount that people want to buy at a given price 
equals the amount that people want to sell at that price will the market be 
in equilibrium. 


16.3 Two Special Cases 


There are two special cases of market equilibrium that are worth mentioning 
since they come up fairly often. The first is the case of fixed supply. Here 
the amount supplied is some given number and is independent of price; 
that is, the supply curve is vertical. In this case the equilibrium quantity 
is determined entirely by the supply conditions and the equilibrium price 
is determined entirely by demand conditions. 

The opposite case is the case where the supply curve is completely hor¬ 
izontal. If an industry has a perfectly horizontal supply curve, it means 
that the industry will supply any amount of a good at a constant price. In 
this situation the equilibrium price is determined by the supply conditions, 
while the equilibrium quantity is determined by the demand curve. 
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The two cases are depicted in Figure 16.1. In these two special cases the 
determination of price and quantity can be separated; but in the general 
case the equilibrium price and the equilibrium quantity are jointly deter¬ 
mined by the demand and supply curves. 
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Special cases of equilibrium. Case A shows a vertical supply 
curve where the equilibrium price is determined solely by the 
demand curve. Case B depicts a horizontal supply curve where 
the equilibrium price is determined solely by the supply curve. 


16.4 Inverse Demand and Supply Curves 

We can look at market equilibrium in a slightly different way that is of¬ 
ten useful. As indicated earlier, individual demand curves are normally 
viewed as giving the optimal quantities demanded as a function of the 
price charged. But we can also view them as inverse demand functions 
that measure the price that someone is willing to pay in order to acquire 
some given amount of a good. The same thing holds for supply curves. 
They can be viewed as measuring the quantity supplied as a function of 
the price. But we can also view them as measuring the price that must 
prevail in order to generate a given amount of supply. 

These same constructions can be used with market demand and market 
supply curves, and the interpretations are just those given above. In this 
framework an equilibrium price is determined by finding that quantity at 


Figure 

16.1 
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which the amount the demanders are willing to pay to consume that quan¬ 
tity is the same as the price that suppliers must receive in order to supply 
that quantity. 

Thus, if we let Ps(q ) be the inverse supply function and Po(q) be the 
inverse demand function, equilibrium is determined by the condition 

P S (q*) = P D (q*). 


EXAMPLE: Equilibrium with Linear Curves 

Suppose that both the demand and the supply curves are linear: 

D(p ) = a — bp 
S(p) = c + dp. 

The coefficients (a, 6, c, d ) are the parameters that determine the inter¬ 
cepts and slopes of these linear curves. The equilibrium price can be found 
by solving the following equation: 

D(p) = a — bp = c + dp = S(p). 


The answer is 


a — c 
d + b 


The equilibrium quantity demanded (and supplied) is 


D(p*) = a — bp* 


ad + be 
= b+d ' 

We can also solve this problem by using the inverse demand and supply 
curves. First we need to find the inverse demand curve. At what price is 
some quantity q demanded? Simply substitute q for D(p) and solve for p. 
We have 

q = a — bp, 


so 

p D(q) 


In the same manner we find 


a — q 


b 


Ps(q) 


q-c 

d 
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Setting the demand price equal to the supply price and solving for the 
equilibrium quantity we have 


* ad + bc 


Note that this gives the same answer as in the original problem for both 
the equilibrium price and the equilibrium quantity. 


16.5 Comparative Statics 

After we have found an equilibrium by using the demand equals supply 
condition (or the demand price equals the supply price condition), we can 
see how it will change as the demand and supply curves change. For ex¬ 
ample, it is easy to see that if the demand curve shifts to the right in a 
parallel way—some fixed amount more is demanded at every price—the 
equilibrium price and quantity must both rise. On the other hand, if the 
supply curve shifts to the right, the equilibrium quantity rises, but the 
equilibrium price must fall. 

What if both curves shift to the right? Then the quantity will definitely 
increase while the change in price is ambiguous—it could increase or it 
could decrease. 


EXAMPLE: Shifting Both Curves 

Question: Consider the competitive market for apartments described in 
Chapter 1. Let the equilibrium price in that market be p* and the equi¬ 
librium quantity be q*. Suppose that a developer converts to of the apart¬ 
ments to condominiums, which are bought by the people who are currently 
living in the apartments. What happens to the equilibrium price? 

Answer: The situation is depicted in Figure 16.2. The demand and sup¬ 
ply curves both shift to the left by the same amount. Hence the price is 
unchanged and the quantity sold simply drops by m. 

Algebraically the new equilibrium price is determined by 

D(p ) — m = S(p) — to, 

which clearly has the same solution as the original demand equals supply 
condition. 
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Figure 

16.2 



Shifting both curves. Both demand and supply curves shift 
to the left by the same amount, which implies the equilibrium 
price will remain unchanged. 


16.6 Taxes 

Describing a market before and after taxes are imposed presents a very nice 
exercise in comparative statics, as well as being of considerable interest in 
the conduct of economic policy. Let us see how it is done. 

The fundamental thing to understand about taxes is that when a tax is 
present in a market, there are two prices of interest: the price the demander 
pays and the price the supplier gets. These two prices—the demand price 
and the supply price—differ by the amount of the tax. 

There are several different kinds of taxes that one might impose. Two 
examples we will consider here are quantity taxes and value taxes (also 
called ad valorem taxes). 

A quantity tax is a tax levied per unit of quantity bought or sold. Gaso¬ 
line taxes are a good example of this. The gasoline tax is roughly 12 cents 
a gallon. If the demander is paying Pd = $1.50 per gallon of gasoline, the 
supplier is getting Pg = $1.50 — .12 = $1.38 per gallon. In general, if t is 
the amount of the quantity tax per unit sold, then 

Pd = Ps + t- 

A value tax is a tax expressed in percentage units. State sales taxes are 
the most common example of value taxes. If your state has a 5 percent 
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sales tax, then when you pay $1.05 for something (including the tax), the 
supplier gets $1.00. In general, if the tax rate is given by r, then 

P D = (l + r)P s . 

Let us consider what happens in a market when a quantity tax is im¬ 
posed. For our first case we suppose that the supplier is required to pay 
the tax, as in the case of the gasoline tax. Then the amount supplied will 
depend on the supply price—the amount the supplier actually gets after 
paying the tax—and the amount demanded will depend on the demand 
price—the amount that the demander pays. The amount that the supplier 
gets will be the amount the demander pays minus the amount of the tax. 
This gives us two equations: 


D{P d ) = S(P s ) 

Ps = Pd ~ t- 

Substituting the second equation into the first, we have the equilibrium 
condition: 

D(P D ) = S(P D -t). 

Alternatively we could also rearrange the second equation to get Pd = 
Ps + t and then substitute to find 

D(P s + t) = S(P s ). 

Either way is equally valid; which one you use will depends on convenience 
in a particular case. 

Now suppose that instead of the supplier paying the tax, the demander 
has to pay the tax. Then we write 


Pd —1 = Ps, 

which says that the amount paid by the demander minus the tax equals the 
price received by the supplier. Substituting this into the demand equals 
supply condition we find 


D{P d ) = S(P D — t). 

Note that this is the same equation as in the case where the supplier 
pays the tax. As far as the equilibrium price facing the demanders and 
the suppliers is concerned, it really doesn’t matter who is responsible for 
paying the tax—it just matters that the tax must be paid by someone. 

This really isn’t so mysterious. Think of the gasoline tax. There the tax 
is included in the posted price. But if the price were instead listed as the 
before-tax price and the gasoline tax were added on as a separate item to 
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be paid by the demanders, then do you think that the amount of gasoline 
demanded would change? After all, the final price to the consumers would 
be the same whichever way the tax was charged. Insofar as the consumers 
can recognize the net cost to them of goods they purchase, it really doesn’t 
matter which way the tax is levied. 

There is an even simpler way to show this using the inverse demand and 
supply functions. The equilibrium quantity traded is that quantity q* such 
that the demand price at q* minus the tax being paid is just equal to the 
supply price at q*. In symbols: 

P D (q*)-t=P S (q*)- 

If the tax is being imposed on the suppliers, then the condition is that 
the supply price plus the amount of the tax must equal the demand price: 

P D (q*) = P s (q*) + t. 

But these are the same equations, so the same equilibrium prices and 
quantities must result. 

Finally, we consider the geometry of the situation. This is most easily 
seen by using the inverse demand and supply curves discussed above. We 
want to find the quantity where the curve Pn{q) — t crosses the curve Ps(q)- 
In order to locate this point we simply shift the demand curve down by t and 
see where this shifted demand curve intersects the original supply curve. 
Alternatively we can find the quantity where Po{q) equals Ps{q) + t. To do 
this, we simply shift the supply curve up by the amount of the tax. Either 
way gives us the correct answer for the equilibrium quantity. The picture 
is given in Figure 16.3. 

From this diagram we can easily see the qualitative effects of the tax. 
The quantity sold must decrease, the price paid by the demanders must go 
up, and the price received by the suppliers must go down. 

Figure 16.4 depicts another way to determine the impact of a tax. Think 
about the definition of equilibrium in this market. We want to find a 
quantity q* such that when the supplier faces the price p s and the demander 
faces the price pd = p s + t, the quantity q* is demanded by the demander 
and supplied by the supplier. Let us represent the tax t by a vertical line 
segment and slide it along the supply curve until it just touches the demand 
curve. That point is our equilibrium quantity! 


EXAMPLE: Taxation with Linear Demand and Supply 

Suppose that the demand and supply curves are both linear. Then if we 
impose a tax in this market, the equilibrium is determined by the equations 


a — bp D = c + dps 
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The imposition of a tax. In order to study the impact of 
a tax, we can either shift the demand curve down, as in panel 
A, or shift the supply curve up, as in panel B. The equilibrium 
prices paid by the demanders and received by the suppliers will 
be the same either way. 


and 

Pd = Ps + t. 

Substituting from the second equation into the first, we have 

a — b(ps + t) = c + dps. 

Solving for the equilibrium supply price, pg, gives 

* a — c — bt 
Ps = 


d + b ' 


The equilibrium demand price, p* * D , is then given by p* s + t: 

a — c — bt 


Pd = 


d + b 
a — c + dt 
d+b 


+ t 


Note that the price paid by the demander increases and the price received 
by the supplier decreases. The amount of the price change depends on the 
slope of the demand and supply curves. 


Figure 

16.3 
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Figure 

16.4 


Amount 
of tax 



Another way to determine the impact of a tax. Slide 
the line segment along the supply curve until it hits the demand 
curve. 


16.7 Passing Along a Tax 

One often hears about how a tax on producers doesn’t hurt profits, since 
firms can simply pass along a tax to consumers. As we’ve seen above, a tax 
really shouldn’t be regarded as a tax on firms or on consumers. Rather, 
taxes are on transactions between firms and consumers. In general, a tax 
will both raise the price paid by consumers and lower the price received by 
firms. How much of a tax gets passed along will therefore depend on the 
characteristics of demand and supply. 

This is easiest to see in the extreme cases: when we have a perfectly 
horizontal supply curve or a perfectly vertical supply curve. These are also 
known as the case of perfectly elastic and perfectly inelastic supply. 

We’ve already encountered these two special cases earlier in this chapter. 
If an industry has a horizontal supply curve, it means that the industry will 
supply any amount desired of the good at some given price, and zero units 
of the good at any lower price. In this case the price is entirely determined 
by the supply curve and the quantity sold is determined by demand. If 
an industry has a vertical supply curve, it means that the quantity of the 
good is fixed. The equilibrium price of the good is determined entirely by 
demand. 

Let’s consider the imposition of a tax in a market with a perfectly elastic 
supply curve. As we’ve seen above, imposing a tax is just like shifting the 
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Special cases of taxation. (A) In the case of a perfectly 
elastic supply curve the tax gets completely passed along to the 
consumers. (B) In the case of a perfectly inelastic supply none 
of the tax gets passed along. 


Figure 

16.5 


supply curve up by the amount of the tax, as illustrated in Figure 16.5A. 

In the case of a perfectly elastic supply curve it is easy to see that the 
price to the consumers goes up by exactly the amount of the tax. The 
supply price is exactly the same as it was before the tax, and the demanders 
end up paying the entire tax. When you think about the meaning of the 
horizontal supply curve, this is not hard to understand. The horizontal 
supply curve means that the industry is willing to supply any amount of 
the good at some particular price, p*, and zero amount at any lower price. 
Thus, if any amount of the good is going to be sold at all in equilibrium, 
the suppliers must receive p* for selling it. This effectively determines the 
equilibrium supply price, and the demand price is p* +1. 

The opposite case is illustrated in Figure 16.5B. If the supply curve is 
vertical and we “shift the supply curve up,” we don’t change anything in 
the diagram. The supply curve just slides along itself, and we still have 
the same amount of the good supplied, with or without the tax. In this 
case, the demanders determine the equilibrium price of the good, and they 
are willing to pay a certain amount, p*, for the supply of the good that is 
available, tax or no tax. Thus they end up paying p*, and the suppliers 
end up receiving p* — t. The entire amount of the tax is paid by the 
suppliers. 

This case often strikes people as paradoxical, but it really isn’t. If the 
suppliers could raise their prices after the tax is imposed and still sell their 
entire fixed supply, they would have raised their prices before the tax was 
imposed and made more money! If the demand curve doesn’t move, then 
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Figure 

16.6 


the only way the price can increase is if the supply is reduced. If a policy 
doesn’t change either supply or demand, it certainly can’t affect price. 

Now that we understand the special cases, we can examine the in-between 
case where the supply curve has an upward slope but is not perfectly ver¬ 
tical. In this situation, the amount of the tax that gets passed along will 
depend on the steepness of the supply curve relative to the demand curve. 
If the supply curve is nearly horizontal, nearly all of the tax gets passed 
along to the consumers, while if the supply curve is nearly vertical, almost 
none of the tax gets passed along. See Figure 16.6 for some examples. 
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Passing along a tax. (A) If the supply curve is nearly hori¬ 
zontal, much of the tax can be passed along. (B) If it is nearly 
vertical, very little of the tax can be passed along. 


16.8 The Deadweight Loss of a Tax 

We’ve seen that taxing a good will typically increase the price paid by the 
demanders and decrease the price received by the suppliers. This certainly 
represents a cost to the demanders and suppliers, but from the economist’s 
viewpoint, the real cost of the tax is that the output has been reduced. 

The lost output is the social cost of the tax. Let us explore the social 
cost of a tax using the consumers’ and producers’ surplus tools developed 
in Chapter 14. We start with the diagram given in Figure 16.7. This 
depicts the equilibrium demand price and supply price after a tax, t, has 
been imposed. 
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Output has been decreased by this tax, and we can use the tools of 
consumers’ and producers’ surplus to value the social loss. The loss in 
consumers’ surplus is given by the areas A + B, and the loss in producers’ 
surplus is given in areas C + D. These are the same kind of losses that we 
examined in Chapter 14. 



The deadweight loss of a tax. The area B + D measures 
the deadweight loss of the tax. 


Since we’re after an expression for the social cost of the tax, it seems 
sensible to add the areas A + B and C+D to each other to get the total loss 
to the consumers and to the producers of the good in question. However, 
we’ve still left out one party—namely, the government. 

The government gains revenue from the tax. And, of course, the con¬ 
sumers who benefit from the government services provided with these tax 
revenues also gain from the tax. We can’t really say how much they gain 
until we know what the tax revenues will be spent on. 

Let us make the assumption that the tax revenues will just be handed 
back to the consumers and the producers, or equivalently that the services 
provided by the government revenues will be just equal in value to the 
revenues spent on them. 

Then the net benefit to the government is the area A + C —the total 
revenue from the tax. Since the loss of producers’ and consumers’ surpluses 
are net costs, and the tax revenue to the government is a net benefit, the 
total net cost of the tax is the algebraic sum of these areas: the loss in 


Figure 

16.7 
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consumers’ surplus, —( A + B ), the loss in producers’ surplus, — (C + D), 
and the gain in government revenue, + {A + C). 

The net result is the area —(B + D). This area is known as the dead¬ 
weight loss of the tax or the excess burden of the tax. This latter phrase 
is especially descriptive. 

Recall the interpretation of the loss of consumers’ surplus. It is how 
much the consumers would pay to avoid the tax. In terms of this diagram 
the consumers are willing to pay A + B to avoid the tax. Similarly, the 
producers are willing to pay C + D to avoid the tax. Together they are 
willing to pay A + B + C + D to avoid a tax that raises A + C dollars of 
revenue. The excess burden of the tax is therefore B + D. 

What is the source of this excess burden? Basically it is the lost value to 
the consumers and producers due to the reduction in the sales of the good. 
You can’t tax what isn’t there. 1 So the government doesn’t get any revenue 
on the reduction in sales of the good. From the viewpoint of society, it is 
a pure loss—a deadweight loss. 

We could also derive the deadweight loss directly from its definition, by 
just measuring the social value of the lost output. Suppose that we start 
at the old equilibrium and start moving to the left. The first unit lost was 
one where the price that someone was willing to pay for it was just equal 
to the price that someone was willing to sell it for. Here there is hardly 
any social loss since this unit was the marginal unit that was sold. 

Now move a little farther to the left. The demand price measures how 
much someone was willing to pay to receive the good, and the supply price 
measures the price at which someone was willing to supply the good. The 
difference is the lost value on that unit of the good. If we add this up over 
the units of the good that are not produced and consumed because of the 
presence of the tax, we get the deadweight loss. 


EXAMPLE: The Market for Loans 

The amount of borrowing or lending in an economy is influenced to a large 
degree by the interest rate charged. The interest rate serves as a price in 
the market for loans. 

We can let D{r) be the demand for loans by borrowers and S(r) be 
the supply of loans by lenders. The equilibrium interest rate, r*, is then 
determined by the condition that demand equal supply: 

D[r*) = S(r*). (16.1) 

Suppose we consider adding taxes to this model. What will happen to 
the equilibrium interest rate? 

1 At least the government hasn’t figured out how to do this yet. But they’re working 
on it. 
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In the U.S. economy individuals have to pay income tax on the interest 
they earn from lending money. If everyone is in the same tax bracket, t , 
the after-tax interest rate facing lenders will be (1 — t)r. Thus the supply 
of loans, which depends on the after-tax interest rate, will be £((1 — t)r). 

On the other hand, the Internal Revenue Service code allows many bor¬ 
rowers to deduct their interest charges, so if the borrowers are in the same 
tax bracket as the lenders, the after-tax interest rate they pay will be 
(1 — t)r. Hence the demand for loans will be D((l — t)r). The equation for 
interest rate determination with taxes present is then 

D{(l-t)r') = S{{l-t)r'). (16.2) 

Now observe that if r* solves equation (16.1), then r* = (1 — t)r' must 
solve equation (16.2) so that 


r* = (1 — t)r ', 


or 


Thus the interest rate in the presence of the tax will be higher by 1/(1— t). 
The after-tax interest rate (1 — t)r' will be r*, just as it was before the tax 
was imposed! 

Figure 16.8 may make things clearer. Making interest income taxable 
will tilt the supply curve for loans up by a factor of 1/(1 — t); but making 
interest payments tax deductible will also tilt the demand curve for loans 
up by 1/(1 — t). The net result is that the market interest rate rises by 
precisely 1/(1 — t). 

Inverse demand and supply functions provide another way to look at this 
problem. Let rb{q) be the inverse demand function for borrowers. This tells 
us what the after-tax interest rate would have to be to induce people to 
borrow q. Similarly, let ri(q) be the inverse supply function for lenders. 
The equilibrium amount lent will then be determined by the condition 

n{q*)=n{q*)- (16.3) 

Now introduce taxes into the situation. To make things more interesting, 
we’ll allow borrowers and lenders to be in different tax brackets, denoted 
by 4 and 4- If the market interest rate is r, then the after-tax rate facing 
borrowers will be (1 — tb)r, and the quantity they choose to borrow will be 
determined by the equation 


(1 - t b )r = r b {q) 


n{g) 

1-4' 


or 


r = 


(16.4) 
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Figure 

16.8 



Equilibrium in the loan market. If borrowers and lenders 
are in the same tax bracket, the after-tax interest rate and the 
amount borrowed are unchanged. 


Similarly, the after-tax rate facing lenders will be (1 — ti)r, and the amount 
they choose to lend will be determined by the equation 

(1 - t{)r = n{q) 


or 


ri(q) 

r = --. 

1 -tl 


(16.5) 


Combining equations (16.4) and (16.5) gives the equilibrium condition: 


n(q) = ri(q) 

1 - t b 1 - ti ‘ 


(16.6) 


From this equation it is easy to see that if borrowers and lenders are in the 
same tax bracket, so that 4 = 4, then q = q*. What if they are in different 
tax brackets? It is not hard to see that the tax law is subsidizing borrowers 
and taxing lenders, but what is the net effect? If the borrowers face a 
higher price than the lenders, then the system is a net tax on borrowing, 
but if the borrowers face a lower price than the lenders, then it is a net 
subsidy. Rewriting the equilibrium condition, equation (16.6), we have 


n{q) 


1-4 
1 -ti 


ri(q)- 
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Thus borrowers will face a higher price than lenders if 


1-4 

1 -ti 


> 1 , 


which means that ti > 4- So if the tax bracket of lenders is greater than 
the tax bracket of borrowers, the system is a net tax on borrowing, but if 
ti < 4, it is a net subsidy. 


EXAMPLE: Food Subsidies 

In years when there were bad harvests in nineteenth-century England the 
rich would provide charitable assistance to the poor by buying up the har¬ 
vest, consuming a fixed amount of the grain, and selling the remainder to 
the poor at half the price they paid for it. At first thought this seems like 
it would provide significant benefits to the poor, but on second thought, 
doubts begin to arise. 

The only way that the poor can be made better off is if they end up 
consuming more grain. But there is a fixed amount of grain available after 
the harvest. So how can the poor be better off because of this policy? 

As a matter of fact they are not; the poor end up paying exactly the 
same price for the grain with or without the policy. To see why, we will 
model the equilibrium with and without this program. Let D(p) be the 
demand curve for the poor, K the amount demanded by the rich, and S 
the fixed amount supplied in a year with a bad harvest. By assumption the 
supply of grain and the demand by the rich are fixed. Without the charity 
provided by the rich, the equilibrium price is determined by total demand 
equals total supply: 

D(p*) + K = S. 

With the program in place, the equilibrium price is determined by 

D(p/ 2) + K = S. 

But now observe: if p* solves the first equation, then p = 2 p* solves the 
second equation. So when the rich offer to buy the grain and distribute it to 
the poor, the market price is simply bid up to twice the original price—and 
the poor pay the same price they did before! 

When you think about it this isn’t too surprising. If the demand of the 
rich is fixed and the supply of grain is fixed, then the amount that the 
poor can consume is fixed. Thus the equilibrium price facing the poor is 
determined entirely by their own demand curve; the equilibrium price will 
be the same, regardless of how the grain is provided to the poor. 
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EXAMPLE: Subsidies in Iraq 

Even subsidies that are put in place “for a good reason” can be extremely 
difficult to dislodge. Why? Because they create a political constituency 
that comes to rely on them. This is true in every country, but Iraq repre¬ 
sents a particularly egregious case. As of 2005, fuel and food subsidies in 
Iraq consumed nearly one third of the government’s budget. 2 

Almost all of the Iraqi government’s budget comes from oil exports. 
There is very little refining capacity in the country, so Iraq imports gasoline 
at 30 to 35 cents a liter, which it then sells to the public at 1.5 cents. 
A substantial amount of this gasoline is sold on the black market and 
smuggled into Turkey, where gas is about one dollar a liter. 

Food and fuel oil are also highly subsidized. Politicians are reluctant to 
remove these subsidies due to the politically unstable environment. When 
similar subsidies were removed in Yemen, there was rioting in the streets, 
with dozens of people dying. A World Bank study concluded that more 
than half of the GDP in Iraq was spent on subsidies. According to the 
finance minister, Ali Abdulameer Allawi, “They’ve reached the point where 
they’ve become insane. They distort the economy in a grotesque way, and 
create the worst incentives you can think of.” 


16.9 Pareto Efficiency 


An economic situation is Pareto efficient if there is no way to make 
any person better off without hurting anybody else. Pareto efficiency is a 
desirable thing—if there is some way to make some group of people better 
off, why not do it?- but efficiency is not the only goal of economic policy. 
For example, efficiency has almost nothing to say about income distribution 
or economic justice. 

However, efficiency is an important goal, and it is worth asking how well 
a competitive market does in achieving Pareto efficiency. A competitive 
market, or any economic mechanism, has to determine two things. First, 
how much is produced, and second, who gets it. A competitive market 
determines how much is produced based on how much people are willing to 
pay to purchase the good as compared to how much people must be paid 
to supply the good. 

Consider Figure 16.9. At any amount of output less than the competitive 
amount q *, there is someone who is willing to supply an extra unit of the 


2 


James Glanz, “Despite Crushing Costs, Iraqi Cabinet Lets Big Subsidies Stand,” New 
York Times , August 11, 2005. 
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Pareto efficiency. The competitive market determines a 
Pareto efficient amount of output because at q* the price that 
someone is willing to pay to buy an extra unit of the good is 
equal to the price that someone must be paid to sell an extra 
unit of the good. 


Figure 

16.9 


good at a price that is less than the price that someone is willing to pay 
for an extra unit of the good. 

If the good were produced and exchanged between these two people at 
any price between the demand price and the supply price, they would both 
be made better off. Thus any amount less than the equilibrium amount 
cannot be Pareto efficient, since there will be at least two people who could 
be made better off. 

Similarly, at any output larger than q*, the amount someone would be 
willing to pay for an extra unit of the good is less than the price that it 
would take to get it supplied. Only at the market equilibrium q* would we 
have a Pareto efficient amount of output supplied—an amount such that 
the willingness to pay for an extra unit is just equal to the willingness to 
be paid to supply an extra unit. 

Thus the competitive market produces a Pareto efficient amount of out¬ 
put. What about the way in which the good is allocated among the con¬ 
sumers? In a competitive market everyone pays the same price for a good— 
the marginal rate of substitution between the good and “all other goods” 
is equal to the price of the good. Everyone who is willing to pay this price 
is able to purchase the good, and everyone who is not willing to pay this 
price cannot purchase the good. 
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What would happen if there were an allocation of the good where the 
marginal rates of substitution between the good and “all other goods” were 
not the same? Then there must be at least two people who value a marginal 
unit of the good differently. Maybe one values a marginal unit at $5 and 
one values it at $4. Then if the one with the lower value sells a bit of the 
good to the one with the higher value at any price between $4 and $5, 
both people would be made better off. Thus any allocation with different 
marginal rates of substitution cannot be Pareto efficient. 


EXAMPLE: Waiting in Line 

One commonly used way to allocate resources is by making people wait 
in line. We can analyze this mechanism for resource allocation using the 
same tools that we have developed for analyzing the market mechanism. 
Let us look at a concrete example: suppose that your university is going to 
distribute tickets to the championship basketball game. Each person who 
waits in line can get one ticket for free. 

The cost of a ticket will then simply be the cost of waiting in line. People 
who want to see the basketball game very much will camp out outside the 
ticket office so as to be sure to get a ticket. People who don’t care very 
much about the game may drop by a few minutes before the ticket window 
opens on the off chance that some tickets will be left. The willingness to pay 
for a ticket should no longer be measured in dollars but rather in waiting 
time, since tickets will be allocated according to willingness to wait. 

Will waiting in line result in a Pareto efficient allocation of tickets? Ask 
yourself whether it is possible that someone who waited for a ticket might 
be willing to sell it to someone who didn’t wait in line. Often this will be 
the case, simply because willingness to wait and willingness to pay differ 
across the population. If someone is willing to wait in line to buy a ticket 
and then sell it to someone else, allocating tickets by willingness to wait 
does not exhaust all the gains to trade—some people would generally still 
be willing to trade the tickets after the tickets have been allocated. Since 
waiting in line does not exhaust all of the gains from trade, it does not in 
general result in a Pareto efficient outcome. 

If you allocate a good using a price set in dollars, then the dollars paid by 
the demanders provide benefits to the suppliers of the good. If you allocate 
a good using waiting time, the hours spent in line don’t benefit anybody. 
The waiting time imposes a cost on the buyers of the good and provide no 
benefits at all to the suppliers. Waiting in line is a form of deadweight 
loss —the people who wait in line pay a “price” but no one else receives 
any benefits from the price they pay. 
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Summary 


1. The supply curve measures how much people will be willing to supply 
of some good at each price. 

2. An equilibrium price is one where the quantity that people are willing 
to supply equals the quantity that people are willing to demand. 

3. The study of how the equilibrium price and quantity change when the 
underlying demand and supply curves change is another example of com¬ 
parative statics. 

4. When a good is taxed, there will always be two prices: the price paid 
by the demanders and the price received by the suppliers. The difference 
between the two represents the amount of the tax. 

5. How much of a tax gets passed along to consumers depends on the 
relative steepness of the demand and supply curves. If the supply curve 
is horizontal, all of the tax gets passed along to consumers; if the supply 
curve is vertical, none of the tax gets passed along. 

6. The deadweight loss of a tax is the net loss in consumers’ surplus plus 
producers’ surplus that arises from imposing the tax. It measures the value 
of the output that is not sold due to the presence of the tax. 

7. A situation is Pareto efficient if there is no way to make some group of 
people better off without making some other group worse off. 

8. The Pareto efficient amount of output to supply in a single market is 
that amount where the demand and supply curves cross, since this is the 
only point where the amount that demanders are willing to pay for an extra 
unit of output equals the price at which suppliers are willing to supply an 
extra unit of output. 
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1. What is the effect of a subsidy in a market with a horizontal supply 
curve? With a vertical supply curve? 

2. Suppose that the demand curve is vertical while the supply curve slopes 
upward. If a tax is imposed in this market who ends up paying it? 
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3. Suppose that all consumers view red pencils and blue pencils as perfect 
substitutes. Suppose that the supply curve for red pencils is upward slop¬ 
ing. Let the price of red pencils and blue pencils be p r and pb . What would 
happen if the government put a tax only on red pencils? 

4. The United States imports about half of its petroleum needs. Suppose 
that the rest of the oil producers are willing to supply as much oil as the 
United States wants at a constant price of $25 a barrel. What would happen 
to the price of domestic oil if a tax of $5 a barrel were placed on foreign 
oil? 

5. Suppose that the supply curve is vertical. What is the deadweight loss 
of a tax in this market? 

6. Consider the tax treatment of borrowing and lending described in the 
text. How much revenue does this tax system raise if borrowers and lenders 
are in the same tax bracket? 

7. Does such a tax system raise a positive or negative amount of revenue 
when ti < tb! 


CHAPTER 



AUCTIONS 


Auctions are one of the oldest form of markets, dating back to at least 500 
BC. Today, all sorts of commodities, from used computers to fresh flowers, 
are sold using auctions. 

Economists became interested in auctions in the early 1970s when the 
OPEC oil cartel raised the price of oil. The U.S. Department of the Inte¬ 
rior decided to hold auctions to sell the right to drill in coastal areas that 
were expected to contain vast amounts of oil. The government asked econ¬ 
omists how to design these auctions, and private firms hired economists as 
consultants to help them design a bidding strategy. This effort prompted 
considerable research in auction design and strategy. 

More recently, the Federal Communications Commission (FCC) decided 
to auction off parts of the radio spectrum for use by cellular phones, per¬ 
sonal digital assistants, and other communication devices. Again, econ¬ 
omists played a major role in the design of both the auctions and the 
strategies used by the bidders. These auctions were hailed as very suc¬ 
cessful public policy, resulting in revenues to the U.S. government of over 
twenty-three billion dollars to date. 

Other countries have also used auctions for privatization projects. For 
example, Australia sold off several government-owned electricity plants, 
and New Zealand auctioned off parts of its state-owned telephone system. 
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Consumer-oriented auctions have also experienced something of a re¬ 
naissance on the Internet. There are hundreds of auctions on the Internet, 
selling collectibles, computer equipment, travel services, and other items. 
OnSale claims to be the largest, reporting over forty-one million dollars 
worth of merchandise sold in 1997. 


17.1 Classification of Auctions 

The economic classification of auctions involves two considerations: first, 
what is the nature of the good that is being auctioned, and second, what 
are the rules of bidding? With respect to the nature of the good, econo¬ 
mists distinguish between private-value auctions and common-value 
auctions. 

In a private-value auction, each participant has a potentially different 
value for the good in question. A particular piece of art may be worth 
$500 to one collector, $200 to another, and $50 to yet another, depending 
on their taste. In a common-value auction, the good in question is worth 
essentially the same amount to every bidder, although the bidders may 
have different estimates of that common value. The auction for off-shore 
drilling rights described above had this characteristic: a given tract either 
had a certain amount of oil or not. Different oil companies may have had 
different estimates about how much oil was there, based on the outcomes of 
their geological surveys, but the oil had the same market value regardless 
of who won the auction. 

We will spend most of the time in this chapter discussing private-value 
auctions, since they are the most familiar case. At the end of the chapter, 
we will describe some of the features of common-value auctions. 


Bidding Rules 

The most prevalent form of bidding structure for an auction is the English 
auction. The auctioneer starts with a reserve price, which is the lowest 
price at which the seller of the good will part with it. 1 Bidders successively 
offer higher prices; generally each bid must exceed the previous bid by some 
minimal bid increment. When no participant is willing to increase the 
bid further, the item is awarded to the highest bidder. 

Another form of auction is known as a Dutch auction, due to its use 
in the Netherlands for selling cheese and fresh flowers. In this case the 
auctioneer starts with a high price and gradually lowers it by steps until 
someone is willing to buy the item. In practice, the “auctioneer” is often 
a mechanical device like a dial with a pointer which rotates to lower and 

1 See the footnote about “reservation price” in Chapter 6. 
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lower values as the auction progresses. Dutch auctions can proceed very 
rapidly, which is one of their chief virtues. 

Yet a third form of auctions is a sealed-bid auction. In this type of 
auction, each bidder writes down a bid on a slip of paper and seals it in 
an envelope. The envelopes are collected and opened, and the good is 
awarded to the person with the highest bid who then pays the auctioneer 
the amount that he or she bid. If there is a reserve price, and all bids are 
lower than the reserve price, then no one may receive the item. 

Sealed-bid auctions are commonly used for construction work. The per¬ 
son who wants the construction work done requests bids from several con¬ 
tractors with the understanding that the job will be awarded to the con¬ 
tractor with the lowest bid. 

Finally, we consider a variant on the sealed bid-auction that is known as 
the philatelist auction or Vickrey auction. The first name is due to 
the fact that this auction form was originally used by stamp collectors; the 
second name is in honor of William Vickrey, who received the 1996 Nobel 
prize for his pioneering work in analyzing auctions. The Vickrey auction is 
like the sealed-bid auction, with one critical difference: the good is awarded 
to the highest bidder, but at the second-highest price. In other words, the 
person who bids the most gets the good, but he or she only has to pay the 
bid made by the second-highest bidder. Though at first this sounds like a 
rather strange auction form, we will see below that it has some very nice 
properties. 


17.2 Auction Design 

Let us suppose that we have a single item to auction off and that there are 
n bidders with (private) values v %,..., v n . For simplicity, we assume that 
the values are all positive and that the seller has a zero value. Our goal is 
to choose an auction form to sell this item. 

This is a special case of an economic mechanism design problem. In 
the case of the auction there are two natural goals that we might have in 
mind: 


• Pareto efficiency. Design an auction that results in a Pareto efficient 
outcome. 

• Profit maximization. Design an auction that yields the highest ex¬ 
pected profit to the seller. 


Profit maximization seems pretty straightforward, but what does Pareto 
efficiency mean in this context? It is not hard to see that Pareto efficiency 
requires that the good be assigned to the person with the highest value. 
To see this, suppose that person 1 has the highest value and person 2 has 
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some lower value for the good. If person 2 receives the good, then there 
is an easy way to make both 1 and 2 better off: transfer the good from 
person 2 to person 1 and have person 1 pay person 2 some price p that lies 
between Vi and V 2 - This shows that assigning the good to anyone but the 
person who has the highest value cannot be Pareto efficient. 

If the seller knows the values Vi,... ,v n the auction design problem is 
pretty trivial. In the case of profit maximization, the seller should just 
award the item to the person with the highest value and charge him or 
her that value. If the desired goal is Pareto efficiency, the person with the 
highest value should still get the good, but the price paid could be any 
amount between that person’s value and zero, since the distribution of the 
surplus does not matter for Pareto efficiency. 

The more interesting case is when the seller does not know the buyers’ 
values. How can one achieve efficiency or profit maximization in this case? 

First consider Pareto efficiency. It is not hard to see that an English 
auction achieves the desired outcome: the person with the highest value will 
end up with the good. It requires only a little more thought to determine 
the price that this person will pay: it will be the value of the second-highest 
bidder plus, perhaps, the minimal bid increment. 

Think of a specific case where the highest value is, say $100, the second- 
highest value is $80, and the bid increment is, say, $5. Then the person 
with the $100 valuation would be willing to bid $85, while the person with 
the $80 value would not. Just as we claimed, the person with the highest 
valuation gets the good, at the second highest price (plus, perhaps, the bid 
increment). (We keep saying “perhaps” since if both players bid $80 there 
would be a tie and the exact outcome would depend on the rule used for 
tie-breaking.) 

What about profit maximization? This case turns out to be more difficult 
to analyze since it depends on the beliefs that the seller has about the 
buyers’ valuations. To see how this works, suppose that there are just 
two bidders either of whom could have a value of $10 or $100 for the 
item in question. Assume these two cases are equally likely, so that there 
are four equally probable arrangements for the values of bidders 1 and 2: 
(10,10), (10,100), (100,10), (100,100). Finally, suppose that the minimal 
bid increment is $1 and that ties are resolved by flipping a coin. 

In this example, the winning bids in the four cases described above will 
be (10,11,11,100) and the bidder with the highest value will always get the 
good. The expected revenue to the seller is $33 = |(10 + 11 + 11 + 100). 

Can the seller do better than this? Yes, if he sets an appropriate reser¬ 
vation price. In this case, the profit-maximizing reservation price is $100. 
Three-quarters of the time, the seller will sell the item for this price, and 
one-quarter of the time there will be no winning bid. This yields an ex¬ 
pected revenue of $75, much higher than the expected revenue yielded by 
the English auction with no reservation price. 

Note that this policy is not Pareto efficient, since one-quarter of the time 
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no one gets the good. This is analogous to the deadweight loss of monopoly 
and arises for exactly the same reason. 

The addition of the reservation price is very important if you are in¬ 
terested in profit maximization. In 1990, the New Zealand government 
auctioned off some of the spectrum for use by radio, television, and cellu¬ 
lar telephones, using a Vickrey auction. In one case, the winning bid was 
NZ$100,000, but the second-highest bid was only NZ$ 6 ! This auction may 
have led to a Pareto efficient outcome, but it was certainly not revenue 
maximizing! 

We have seen that the English auction with a zero reservation price 
guarantees Pareto efficiency. What about the Dutch auction? The answer 
here is not necessarily. To see this, consider a case with two bidders who 
have values of $100 and $80. If the high-value person believes (erroneously!) 
that the second-highest value is $70, he or she would plan to wait until the 
auctioneer reached, say, $75 before bidding. But, by then, it would be too 
late—the person with the second-highest value would have already bought 
the good at $80. In general, there is no guarantee that the good will be 
awarded to the person with the highest valuation. 

The same holds for the case of a sealed-bid auction. The optimal bid for 
each of the agents depends on their beliefs about the values of the other 
agents. If those beliefs are inaccurate, the good may easily end up being 
awarded to someone who does not have the highest valuation . 2 

Finally, we consider the Vickrey auction—the variant on the sealed-bid 
auction where the highest bidder gets the item, but only has to pay the 
second-highest price. 

First we observe that if everyone bids their true value for the good in 
question, the item will end up being awarded to the person with the highest 
value, who will pay a price equal to that of the person with the second- 
highest value. This is essentially the same as the outcome of the English 
auction (up to the bid increment, which can be arbitrarily small). 

But is it optimal to state your true value in a Vickrey auction? We saw 
that for the standard sealed-bid auction, this is not generally the case. But 
the Vickrey auction is different: the surprising answer is that it is always 
in each player’s interest to write down their true value. 

To see why, let us look at the special case of two bidders, who have 
values Vi and v 2 and write down bids of b\ and 62 ■ The expected payoff to 
bidder 1 is: 

Prob(&i > b 2 ) [vi — 62 ], 


2 On the other hand, if all players’ beliefs are accurate, on average, and all bidders 
play optimally, the various auction forms described above turn out to yield the same 
allocation and the same expected price in equilibrium. For a detailed analysis, see 
P. Milgrom, “Auctions and Bidding: a Primer,” Journal of Economic Perspectives , 
3(3), 1989, 3-22, and P. Klemperer, “Auction Theory: A Guide to the Literature,” 
Economic Surveys , 13(3), 1999, 227-286. 
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where “Prob” stands for “probability.” 

The first term in this expression is the probability that bidder 1 has the 
highest bid; the second term is the consumer surplus that bidder 1 enjoys 
if he wins. (If b\ < 62 , then bidder 1 gets a surplus of 0, so there is no need 
to consider the term containing Prob( 6 i < 62 ).) 

Suppose that Vi > & 2 - Then bidder 1 wants to make the probability of 
winning as large as possible, which he can do by setting bi = v±. Suppose, 
on the other hand, that v\ < 62 - Then bidder 1 wants to make the proba¬ 
bility of winning as small as possible, which he can do by setting b\ = V\. 
In either case, an optimal strategy for bidder 1 is to set his bid equal to his 
true value! Honesty is the best policy ... at least in a Vickrey auction! 

The interesting feature of the Vickrey auction is that it achieves essen¬ 
tially the same outcome as an English auction, but without the iteration. 
This is apparently why it was used by stamp collectors. They sold stamps 
at their conventions using English auctions and via their newsletters using 
sealed-bid auctions. Someone noticed that the sealed-bid auction would 
mimic the outcome of the English auctions if they used the second-highest 
bid rule. But it was left to Vickrey to conduct the full-fledged analysis of 
the philatelist auction and show that truth-telling was the optimal strategy 
and that the philatelist auction was equivalent to the English auction. 


17.3 Other Auction Forms 

The Vickrey auction was thought to be only of limited interest until online 
auctions became popular. The world’s largest online auction house, eBay, 
claims to have almost 30 million registered users who, in 2000, traded $5 
billion worth of merchandise. 

Auctions run by eBay last for several days, or even weeks, and it is 
inconvenient for users to monitor the auction process continually. In or¬ 
der to avoid constant monitoring, eBay introduced an automated bidding 
agent, which they call a proxy bidder. Users tell their bidding agent 
the most they are willing to pay for an item and an initial bid. As the 
bidding progresses, the agent automatically increases a participant’s bid 
by the minimal bid increment when necessary, as long as this doesn’t raise 
the participant’s bid over his or her maximum. 

Essentially this is a Vickrey auction: each user reveals to their bidding 
agent the maximum price he or she is willing to pay. In theory, the par¬ 
ticipant who enters the highest bid will win the item but will only have 
to pay the second-highest bid (plus a minimal bid increment to break the 
tie.) According to the analysis in the text, each bidder has an incentive to 
reveal his or her true value for the item being sold. 

In practice, bidder behavior is a bit different than that predicted by the 
Vickrey model. Often bidders wait until close to the end of the auction to 
enter their bids. This behavior appears to be for two distinct reasons: a 
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reluctance to reveal interest too early in the game, and the hope to snatch 
up a bargain in an auction with few participants. Nevertheless, the bidding 
agent model seems to serve users very well. The Vickrey auction, which 
was once thought to be only of theoretical interest, is now the preferred 
method of bidding for the world’s largest online auction house! 

There are even more exotic auction designs in use. One peculiar example 
is the escalation auction. In this type of auction, the highest bidder wins 
the item, but the highest and the second-highest bidders both have to pay 
the amount they bid. 

Suppose, for example, that you auction off 1 dollar to a number of bidders 
under the escalation auction rules. Typically a few people bid 10 or 15 
cents, but eventually most of the bidders drop out. When the highest bid 
approaches 1 dollar, the remaining bidders begin to catch on to the problem 
they face. If one has bid 90 cents, and the other 85 cents, the low bidder 
realizes that if he stays put, he will pay 85 cents and get nothing but, if he 
escalates to 95 cents, he will walk away with a nickel. 

But once he has done this, the bidder who was at 90 cents can reason the 
same way. In fact, it is in her interest to bid over a dollar. If, for example, 
she bids $1.05 (and wins), she will lose only 5 cents rather than 90 cents! 
It’s not uncommon to see the winning bid end up at $5 or $6. 

A somewhat related auction is the everyone pays auction. Think of 
a crooked politician who announces that he will sell his vote under the 
following conditions: all the lobbyists contribute to his campaign, but he 
will vote for the appropriations favored by the highest contributor. This is 
essentially an auction where everyone pays but only the high bidder gets 
what she wants! 


EXAMPLE: Late Bidding on eBay 

According to standard auction theory eBay’s proxy bidder should induce 
people to bid their true value for an item. The highest bidder wins at 
(essentially) the second highest bid, just as in a Vickrey auction. But it 
doesn’t work quite like that in practice. In many auctions, participants 
wait until virtually the last minute to place their bids. In one study, 37 
percent of the auctions had bids in the last minute and 12 percent had bids 
in the last 10 seconds. Why do we see so many “late bids”? 

There are at least two theories to explain this phenomenon. Patrick 
Bajari and Ali Hortagsu, two auction experts, argue that for certain sorts 
of auctions, people don’t want to bid early to avoid driving up the selling 
price. EBay typically displays the bidder identification and actual bids 
(not the maximum bids) for items being sold. If you are an expert on rare 
stamps, with a well-known eBay member name, you may want to hold back 
placing your bid so as not to reveal that you are interested in a particular 
stamp. 


322 AUCTIONS (Ch. 17) 


This explanation makes a lot of sense for collectibles such as stamps and 
coins, but late bidding also occurs in auctions for generic items, such as 
computer parts. A1 Roth and Axel Ockenfels suggest that late bidding is 
a way to avoiding bidding wars. 

Suppose that you and someone else are bidding for a Pez dispenser with 
a seller’s reserve price of $2. It happens that you each value the dispenser 
at $10. If you both bid early, stating your true maximum value of $10, 
then even if the tie is resolved in your favor you end up paying $10—since 
that is also the other bidder’s maximum value. You may “win” but you 
don’t get any consumer surplus! 

Alternatively, suppose that each of you waits until the auction is almost 
over and then bids $10 in the last possible seconds of the auction. (At 
eBay, this is called “sniping.”) In this case, there’s a good chance that 
one of the bids won’t get through, so the winner ends up paying only the 
seller’s reserve price of $2. 

Bidding high at the last minute introduces some randomness into the 
outcome. One of the players gets a great deal and the other gets nothing. 
But that’s not necessarily so bad: if they both bid early, one of the players 
ends up paying his full value and the other gets nothing. 

In this analysis, the late bidding is a form of “implicit collusion.” By 
waiting to bid, and allowing chance to play a role, bidders can end up doing 
substantially better on average than they do by bidding early. 


17.4 Position Auctions 

A position auction is a way to auction off positions, such as a position 
in a line or a position on a web page. The defining characteristic is that 
all players rank the positions in the same way, but they may value the 
positions differently. Everybody would agree that it is better to be in the 
front of the line than further back, but they could be willing to pay different 
amounts to be first in line. 

One prominent example of a position auction is the auction used by 
search engine providers such as Google, Microsoft, and Yahoo to sell ads. 
In this case all advertisers agree that being in the top position is best, 
the second from the top position is second best, and so on. However, the 
advertisers are often selling different things, so the expected profit that 
they will get from a visitor to their web page will differ. 

Here we describe a simplified version of these online ad auctions. De¬ 
tails differ across search engines, but the model below captures the general 
behavior. 

We suppose that there are s = 1,..., S slots where ads can be displayed. 
Let x s denote the number of clicks that an ad can expect to receive in slot 
s. We assume that slots are ordered with respect to the number of clicks 
they are likely to receive, so Xi > ^ > ' • • > ^s- 
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Each of the advertisers has a value per click, which is related to the 
expected profit it can get from a visitor to its web site. Let v s be the value 
per click of the advertiser whose ad is shown in slot s. 

Each advertiser states a bid, b 3 , which is interpreted as the amount 
it is willing to pay for slot s. The best slot (slot 1) is awarded to the 
advertiser with the highest bid, the second-best slot (slot 2 ) is awarded to 
the advertiser with the second highest bid, and so on. 

The price that an advertiser pays for a bid is determined by the bid of 
the advertiser below him. This is a variation on the Vickrey auction model 
described earlier and is sometimes known as a generalized second price 
auction or GSP. 

In the GSP, advertiser 1 pays b 2 per click, advertiser 2 pays 63 per click, 
and so on. The rationale for this arrangement is that if an advertiser paid 
the price it bid, it would have an incentive to cut its bid until it just beat 
the advertiser below it. By setting the payment of the advertiser in slot s 
to be the bid of the advertiser in slot s + 1 , each advertiser ends up paying 
the minimum bid necessary to retain its position. 

Putting these pieces together, we see that the profit of the advertiser in 
slot s is (v s — b s +i)x a . This is just the value of the clicks minus the cost of 
the clicks that an advertiser receives. 

What is the equilibrium of this auction? Extrapolating from the Vickrey 
auction, one might speculate that each advertiser should bid its true value. 
This is true if there is only one slot being auctioned, but is false in general. 


Two Bidders 

Let us look at the case of 2 slots and 2 bidders. We assume that the high 
bidder gets x\ clicks and pays the bid of the second highest bidder b 2 . The 
second highest bidder gets slot 2 and pays a reserve price r. 

Suppose your value is v and you bid b. If b > b 2 you get a payoff of 
(v — b 2 )xi and if b < b 2 you get a payoff of (v — r)x 2 . Your expected payoff 
is then 


Prob (6 > b 2 )(v — b 2 )xi + [1 — Prob (6 > 62)] (u — r)x 2 - 
We can rearrange your expected payoff to be 

{v - r)x 2 + Prob (6 > b 2 )[v{x\ - x 2 ) + rx 2 - b 2 x\] (17.1) 

Note that when the term in the brackets is positive (i.e., you make a 
profit), you want the probability that b > b 2 to be as large as possible, and 
when the term is negative (you make a loss) you want the probability that 
b > b 2 to be as small as possible. 
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However, this can easily be arranged. Simply choose a bid according to 
this formula: 


bx i = v(xi — x 2 ) + rx 2- 


Now it is easy to check that when b > 62, the bracketed term in expression 
(17.1) is positive and when b < 62 the bracketed term in (17.1) is negative 
or zero. Hence this bid will win the auction exactly when you want to win 
and lose it exactly when you want to lose. 

Note that this bidding rule is a dominant strategy: each bidder wants 
to bid according to this formula, regardless of what the other player bids. 
This means, of course, that the auction ends up putting the bidder with 
the highest value in first place. 

It is also easy to interpret the bid. If there are two bidders and two 
slots, the second highest bidder will always get the second slot and end up 
paying rX2■ The contest is about the extra clicks that the highest bidder 
gets. The bidder who has the highest value will win those clicks, but that 
bidder only has to pay the minimum amount necessary to beat the second 
highest bidder. 

We see that in this auction, you don’t want to bid your true value per 
click, but you do want to bid an amount that reflects your true value of 
the incremental clicks you are getting. 


More Than Two Bidders 

What happens if there are more than two bidders? In this case, there 
will typically not be a dominant strategy equilibrium, but there will be a 
equilibrium in prices. Let us look at a situation with 3 slots and 3 bidders. 

The bidder in slot 3 pays a reservation price r. In equilibrium, the bidder 
won’t want to move up to slot 2 , so 

(v 3 - r)x 3 > ( v 3 - p 2 )x2 


or 

v 3 (x 2 - x 3 ) < p 2 X2 - rx 3 . 

This inequality says that if the bidder prefers position 3 to position 2, the 
value of the extra clicks it gets in position 2 must be less than the cost of 
those extra clicks. 

This inequality gives us a bound on the cost of clicks in position 2: 

P2X2 < rx 3 + v 3 (x 2 - x 3 ). (17-2) 


Applying the same argument to the bidder in position 2, we have 


P1X1 < P2X2 + v 2 (xi - x 2 ). 


( 17 . 3 ) 
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Substituting inequality (17.2) into inequality (17.3) we have 

Pixi < rx 3 + v 3 (x 2 - x 3 ) + v 2 {x\ - x 2 ). (17-4) 

The total revenue in the auction is piX\+p 2 x 2 +p 3 x 3 . Adding inequality 
(17.2) to (17.3) and the revenue for slot 3 we have 

R = v 2 (xi — x 2 ) + 2v 3 (x 2 — x 3 ) + 3rx 3 . 

So far, we have looked at 3 bidders for 3 slots. What happens if there 
are 4 bidders for the 3 slots? In this case the reserve price is replaced by 
the value of the fourth bidder. The logic is that the fourth bidder is willing 
to buy any clicks that exceed its value, just as with the standard Vickrey 
auction. This gives us a revenue expression of 

R! = 3v^x 3 + 2v 3 (x 2 — x 3 ) + v 2 {x\ — x 2 ). 

We note a few things about this expression. First, the competition in the 
search engine auction is about incremental clicks: how many clicks you get 
if you bid for a higher position. Second, the bigger the gap between clicks 
the larger the revenue. Third, when v± > r the revenue will be larger. This 
simply says that competition tends to push revenue up. 


Quality Scores 

In practice, the bids are multiplied by a quality score to get an auction 
ranking score. The ad with the highest bid times quality gets first position, 
the second-highest ranking ad gets the second position, and so on. Each ad 
pays the minimum price per click necessary to retain its position. If we let 
q s be the quality of the ad in slot s, the ads are ordered by b\q\ > b 2 q s > 
b 3 q 3 - ■ ■ and so on. 

The price that the ad in slot 1 pays is just enough to retain its position, 
so piqi = b 2 q 2 , or pi = b 2 q 2 /q\. (There may be some rounding to break 
ties.) 

There are several components of ad quality. However, the major com¬ 
ponent is typically the historical clickthrough rate that an ad gets. This 
means that ad rank is basically determined by 

cost clicks cost 

_x _==_ 

clicks impressions impressions 

Hence the ad that gets first place will be the one that is willing to pay the 
most per impression (i.e., ad view) rather than price per click. 

When you think about it, this makes a lot of sense. Suppose one adver¬ 
tiser is willing to pay $10 per click but is likely to get only 1 click in a day. 
Another advertiser is willing to pay $1 per click will get 100 clicks in a day. 
Which ad should be shown in the most prominent position? 

Ranking ads in this way also helps the users. If two ads have the same 
bid, then the one that users tend to click on more will get a higher position. 
Users can “vote with their clicks” for the ads that they find the most useful. 
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17.5 Problems with Auctions 

We’ve seen above that English auctions (or Vickrey auctions) have the 
desirable property of achieving Pareto efficient outcomes. This makes them 
attractive candidates for resource allocation mechanisms. In fact, most of 
the airwave auctions used by the FCC were variants on the English auction. 

But English auctions are not perfect. They are still susceptible to col¬ 
lusion. The example of pooling in auction markets, described in Chapter 
24, shows how antique dealers in Philadelphia colluded on their bidding 
strategies in auctions. 

There are also various ways to manipulate the outcome of auctions. In 
the analysis described earlier, we assumed that a bid committed the bid¬ 
der to pay. However, some auction designs allow bidders to drop out once 
the winning bids are revealed. Such an option allows for manipulation. 
For example, in 1993 the Australian government auctioned off licenses for 
satellite-television services using a standard sealed-bid auction. The win¬ 
ning bid for one of the licenses, A$212 million, was made by a company 
called Ucom. Once the government announced Ucom had won, they pro¬ 
ceeded to default on their bid, leaving the government to award the license 
to the second-highest bidder—which was also Ucom! They defaulted on 
this bid as well; four months later, after several more defaults, they paid 
A$117 million for the license, which was AS95 million less than their initial 
winning bid! The license ended up being awarded to the highest bidder at 
the second-highest price—but the poorly designed auction caused at least 
a year delay in bringing pay-TV to Australia. 3 


EXAMPLE: Taking Bids Off the Wall 

One common method for manipulating auctions is for the seller to take 
fictitious bids, a practice known as “taking bids off the wall.” Such manip¬ 
ulation has found its way to online auctions as well, even where no walls 
are involved. 

According to a recent news story, 4 a New York jeweler sold large quanti¬ 
ties of diamonds, gold, and platinum jewelry online. Though the items were 
offered on eBay with no reserve price, the seller distributed spreadsheets 
to his employees which instructed them to place bids in order to increase 


3 See John McMillan, “Selling Spectrum Rights,” Journal of Economic Perspectives , 
8(3), 145-152, for details of this story and how its lessons were incorporated into the 
design of the U.S. spectrum auction. This article also describes the New Zealand 
example mentioned earlier. 

4 Barnaby J. Feder, “Jeweler to Pay $400,000 in Online Auction Fraud Settlement,” 
New York Times, June 9, 2007. 
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the final sales price. According to the lawsuit, the employees placed over 
232,000 bids in a one-year period, inflating the selling prices by 20% on 
average. 

When confronted with the evidence, the jeweler agreed to pay a $400,000 
fine to settle the civil fraud complaint. 


17.6 The Winner's Curse 

We turn now to the examination of common-value auctions, where the 
good that is being awarded has the same value to all bidders. However, each 
of the bidders may have different estimates of that value. To emphasize 
this, let us write the (estimated) value of bidder i as v + where v is the 
true, common value and is the “error term” associated with bidder V s 
estimate. 

Let’s examine a sealed-bid auction in this framework. What bid should 
bidder i place? To develop some intuition, let’s see what happens if each 
bidder bids their estimated value. In this case, the person with the highest 
value of e,, e mQX , gets the good. But as long as e max > 0, this person 
is paying more than v, the true value of the good. This is the so-called 
Winner’s Curse. If you win the auction, it is because you have overes¬ 
timated the value of the good being sold. In other words, you have won 
only because you were too optimistic! 

The optimal strategy in a common-value auction like this is to bid less 
than your estimated value—and the more bidders there are, the lower you 
want your own bid to be. Think about it: if you are the highest bidder 
out of five bidders you may be overly optimistic, but if you are the highest 
bidder out of twenty bidders you must be super optimistic. The more 
bidders there are, the more humble you should be about your own estimates 
of the “true value” of the good in question. 

The Winner’s Curse seemed to be operating in the FCC’s May 1996 
spectrum auction for personal communications services. The largest bidder 
in that auction, NextWave Personal Communications Inc., bid $4.2 billion 
for sixty-three licenses, winning them all. However, in January 1998 the 
company filed for Chapter Eleven bankruptcy protection, after finding itself 
unable to pay its bills. 


17.7 Stable Marriage Problem 

There are many examples of two-sided matching models where con¬ 
sumers are matched up with each other. Men may be matched with women 
by a dating service or matchmaker, students may be matched with colleges, 
pledges may be matched with sororities, interns matched with hospitals, 
and so on. 
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What are good algorithms for making such matches? Do “stable” out¬ 
comes always exist? Here we examine a simple mechanism for making 
matches that are stable in a precisely defined sense. 

Let us suppose that there are n men and an equal number of women and 
we need to match them up as dancing partners. Each woman can rank 
the men according to her preferences and the same goes for the men. For 
simplicity, let us suppose that there are no ties in these rankings and that 
everyone would prefer to dance than to sit on the sidelines. 

What is a good way to arrange for dancing partners? One attractive 
criterion is to find a way to produce a “stable” matching. The definition 
of stable, in this context, is that there is no couple that would prefer each 
other to their current partner. Said another way, if a man prefers another 
woman to his current partner, that woman wouldn’t want him—she would 
prefer the partner she currently had. 

Does a stable matching always exist? If so, how can one be found? 
The answer is that, contrary to the impression one would get from soap 
operas and romance novels, there always are stable matchings and they are 
relatively easy to construct. 

The most famous algorithm, known as the deferred acceptance algo¬ 
rithm, goes like this. 5 

Step 1. Each man proposes to his most preferred woman. 

Step 2. Each woman records the list of proposals she receives on her dance 
card. 

Step 3. After all men have proposed to their most-preferred choice, each 
woman (gently) rejects all of the suitors except for her most preferred. 

Step 4. The rejected suitors propose to the next woman on their lists. 

Step 5. Continue to step 2 or terminate the algorithm when every woman 
has received an offer. 


This algorithm always produce a stable matching. Suppose, to the con¬ 
trary, that there is some man that prefers another woman to his present 
partner. Then he would have invited her to dance before his current part¬ 
ner. If she preferred him to her current partner, she would have rejected 
her current partner earlier in the process. 


5 Gale, David, and Lloyd Shapley [1962], “College Admissions and the Stability of 
Marriage,” American Mathematical Monthly, 69, 9-15. 
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It turns out that this algorithm yields the best possible stable matching 
for the men in the sense that each man prefers the outcome of this matching 
process to any other stable matching. Of course, if we flipped the roles of 
men and women, we would find the woman-optimal stable matching. 

Though the example described is slightly frivolous, processes like the 
deferred acceptance algorithm are used to match students to schools in 
Boston and New York, residents to hospitals nationwide, and even organ 
donors to recipients. 


17.8 Mechanism Design 

Auctions and the two-sided matching model that we have discussed in this 
chapter are examples of economic mechanisms. The idea of an economic 
mechanism is to define a “game” or “market” that will yield some desired 
outcome. 

For example, one might want to design a mechanism to sell a painting. 
A natural mechanism here would be an auction. But even with an auction, 
there are many design choices. Should it be designed to maximize efficiency 
(i.e., to ensure that the painting goes to the person who values it most 
highly) or should it be designed to maximize expected revenue for the 
seller, even if there is a risk that the painting may not be sold? 

We’ve seen earlier that there are several different types of auctions, each 
with advantages and disadvantages. Which one is best in a particular 
circumstance? 

Mechanism design is essentially the inverse of game theory. With game 
theory, we are given a description of the rules of the game and want to 
determine what the outcome will be. With mechanism design, we are 
given a description of the outcome that we want to reach and try to design 
a game that will reach it. 6 

Mechanism design is not limited to auctions or matching problems. It 
also includes voting mechanisms and public goods mechanisms, such as 
those described in Chapter 35, or externality mechanisms, such as those 
described in Chapter 33. 

In a general mechanism, we think of a number of agents (i.e., consumers 
or firms) who each have some private information. In the case of an auction, 
this private information might be their value for the item being auctioned. 
In a problem involving firms, the private information might be their cost 
functions. 

The agents report some message about their private information to the 
“center,” which we might think of as an auctioneer. The center examines 
the messages and reports some outcome: who receives the item in question, 


6 The 2007 Nobel Prize in Economics was awarded to Leo Hurwicz, Roger Myerson, 
and Eric Maskin for their contributions to economic mechanism design. 
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what output firms should produce, how much various parties have to pay 
or be paid, and so on. 

The major design decisions are 1) what sort of messages should be sent 
to the center and 2) what rule the center should use to determine the 
outcome. The constraints on the problem are the usual sort of resource 
constraints (i.e., there is only one item to be sold) and the constraints that 
the individuals will act in their own self-interest. This latter constraint is 
known as the incentive compatibility constraint. 

There may be other constraints as well. For example, we may want the 
agents to participate voluntarily in the mechanism, which would require 
that they get at least as high a payoff from participating as not participat¬ 
ing. We will ignore this constraint for simplicity. 

To get a flavor of what mechanism design looks like, let us consider 
a simple problem of awarding an indivisible good to one of two different 
agents. Let (aq, X 2 ) = (1,0) if agent 1 gets the good and (xi,^) = (0,1) 
if agent 2 gets the good. Let p be the price paid for the good. 

We suppose that the message that each agent sends to the center is just 
a reported value for the good. This is known as a direct revelation 
mechanism. The center will then award the good to the agent with the 
highest reported value and charge that agent some price p. 

What are the constraints on pi Suppose agent 1 has the highest value. 
Then his message to the center should be such that the payoff he gets in 
response to that message is at least as large as the payoff he would get if 
he sent the same message as agent 2 (who gets a zero payoff). This says 


v\ — p > 0. 


By the same token, agent 2 must get at least as large a payoff from his 
message as he would get if he sent the message sent by agent 1 (which 
resulted in agent 1 getting the good). This says 


0>v 2 — in¬ 


putting these two conditions together, we have v± > p > V 2 , which says 
that the price charged by the center must lie between the highest and 
second-highest value. 

In order to determine which price the center must charge, we need to 
consider its objects and its information. If the center believes that the v\ 
can be arbitrarily close to V 2 and it always wants to award the item to the 
highest bidder, then it has to set a price of V 2 - 

This is just the Vickrey auction described earlier, in which each party 
submits a bid and the item is awarded to the highest bidder at the second- 
highest bid. This is clearly an attractive mechanism for this particular 
problem. 
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Summary 

1. Auctions have been used for thousands of years to sell things. 

2. If each bidder’s value is independent of the other bidders, the auction 
is said to be a private-value auction. If the value of the item being sold is 
essentially the same for everyone, the auction is said to be a common-value 
auction. 

3. Common auction forms are the English auction, the Dutch auction, the 
sealed-bid auction, and the Vickrey auction. 

4. English auctions and Vickrey auctions have the desirable property that 
their outcomes are Pareto efficient. 

5. Profit-maximizing auctions typically require a strategic choice of the 
reservation price. 

6. Despite their advantages as market mechanisms, auctions are vulnerable 
to collusion and other forms of strategic behavior. 


REVIEW QUESTIONS 


1. Consider an auction of antique quilts to collectors. Is this a private-value 
or a common-value auction? 

2. Suppose that there are only two bidders with values of $8 and $10 for 
an item with a bid increment of $1. What should the reservation price be 
in a profit-maximizing English auction? 

3. Suppose that we have two copies of Intermediate Microeconomics to sell 
to three (enthusiastic) students. How can we use a sealed-bid auction that 
will guarantee that the bidders with the two highest values get the books? 

4. Consider the Ucom example in the text. Was the auction design efficient? 
Did it maximize profits? 

5. A game theorist fills a jar with pennies and auctions it off on the first day 
of class using an English auction. Is this a private-value or a common-value 
auction? Do you think the winning bidder usually makes a profit? 


CHAPTER 



TECHNOLOGY 


In this chapter we begin our study of firm behavior. The first thing to do is 
to examine the constraints on a firm’s behavior. When a firm makes choices 
it faces many constraints. These constraints are imposed by its customers, 
by its competitors, and by nature. In this chapter we’re going to consider 
the latter source of constraints: nature. Nature imposes the constraint that 
there are only certain feasible ways to produce outputs from inputs: there 
are only certain kinds of technological choices that are possible. Here we 
will study how economists describe these technological constraints. 

If you understand consumer theory, production theory will be very easy 
since the same tools are used. In fact, production theory is much simpler 
than consumption theory because the output of a production process is 
generally observable, whereas the “output” of consumption (utility) is not 
directly observable. 


18.1 Inputs and Outputs 

Inputs to production are called factors of production. Factors of produc¬ 
tion are often classified into broad categories such as land, labor, capital, 
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and raw materials. It is pretty apparent what labor, land, and raw mate¬ 
rials mean, but capital may be a new concept. Capital goods are those 
inputs to production that are themselves produced goods. Basically capital 
goods are machines of one sort or another: tractors, buildings, computers, 
or whatever. 

Sometimes capital is used to describe the money used to start up or 
maintain a business. We will always use the term financial capital for 
this concept and use the term capital goods, or physical capital, for 
produced factors of production. 

We will usually want to think of inputs and outputs as being measured 
in flow units: a certain amount of labor per week and a certain number of 
machine hours per week will produce a certain amount of output a week. 

We won’t find it necessary to use the classifications given above very 
often. Most of what we want to describe about technology can be done 
without reference to the kind of inputs and outputs involved- just with 
the amounts of inputs and outputs. 


18.2 Describing Technological Constraints 

Nature imposes technological constraints on firms: only certain combi¬ 
nations of inputs are feasible ways to produce a given amount of output, 
and the firm must limit itself to technologically feasible production plans. 

The easiest way to describe feasible production plans is to list them. 
That is, we can list all combinations of inputs and outputs that are tech¬ 
nologically feasible. The set of all combinations of inputs and outputs that 
comprise a technologically feasible way to produce is called a production 
set. 

Suppose, for example, that we have only one input, measured by x, and 
one output, measured by y. Then a production set might have the shape 
indicated in Figure 18.1. To say that some point (x, y) is in the production 
set is just to say that it is technologically possible to produce y amount 
of output if you have x amount of input. The production set shows the 
possible technological choices facing a firm. 

As long as the inputs to the firm are costly it makes sense to limit our¬ 
selves to examining the maximum possible output for a given level of input. 
This is the boundary of the production set depicted in Figure 18.1. The 
function describing the boundary of this set is known as the production 
function. It measures the maximum possible output that you can get 
from a given amount of input. 

Of course, the concept of a production function applies equally well if 
there are several inputs. If, for example, we consider the case of two inputs, 
the production function f(x 1 , 2 : 2 ) would measure the maximum amount of 
output y that we could get if we had X\ units of factor 1 and X2 units of 
factor 2. 
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y = OUTPUT 


y = f(x) = production function 


Production set 


x = INPUT 


Figure 

18.1 


A production set. Here is a possible shape for a production 
set. 


In the two-input case there is a convenient way to depict production 
relations known as the isoquant. An isoquant is the set of all possible 
combinations of inputs 1 and 2 that are just sufficient to produce a given 
amount of output. 

Isoquants are similar to indifference curves. As we’ve seen earlier, an 
indifference curve depicts the different consumption bundles that are just 
sufficient to produce a certain level of utility. But there is one important 
difference between indifference curves and isoquants. Isoquants are labeled 
with the amount of output they can produce, not with a utility level. Thus 
the labeling of isoquants is fixed by the technology and doesn’t have the 
kind of arbitrary nature that the utility labeling has. 

18.3 Examples of Technology 

Since we already know a lot about indifference curves, it is easy to under¬ 
stand how isoquants work. Let’s consider a few examples of technologies 
and their isoquants. 

Fixed Proportions 

Suppose that we are producing holes and that the only way to get a hole is 
to use one man and one shovel. Extra shovels aren’t worth anything, and 
neither are extra men. Thus the total number of holes that you can produce 
will be the minimum of the number of men and the number of shovels that 
you have. We write the production function as f(x 1 , 2 : 2 ) = min{xi,X 2 }. 
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X 2 


Isoquants 


*1 


Fixed proportions. Isoquants for the case of fixed propor¬ 
tions. 


Figure 

18.2 


The isoquants look like those depicted in Figure 18.2. Note that these 
isoquants are just like the case of perfect complements in consumer theory. 


Perfect Substitutes 

Suppose now that we are producing homework and the inputs are red 
pencils and blue pencils. The amount of homework produced depends only 
on the total number of pencils, so we write the production function as 
f(x i, X 2 ) = X\ + X 2 - The resulting isoquants are just like the case of perfect 
substitutes in consumer theory, as depicted in Figure 18.3. 


Cobb-Douglas 

If the production function has the form f(x 1 ,^ 2 ) = Ax^x^, then we say 
that it is a Cobb-Douglas production function. This is just like the 
functional form for Cobb-Douglas preferences that we studied earlier. The 
numerical magnitude of the utility function was not important, so we set 
A = 1 and usually set a + b = 1. But the magnitude of the production 
function does matter so we have to allow these parameters to take arbitrary 
values. The parameter A measures, roughly speaking, the scale of produc¬ 
tion: how much output we would get if we used one unit of each input. 
The parameters a and b measure how the amount of output responds to 
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Figure 

18.3 



Perfect substitutes. Isoquants for the case of perfect substi¬ 
tutes. 


changes in the inputs. We’ll examine their impact in more detail later on. 
In some of the examples, we will choose to set A = 1 in order to simplify 
the calculations. 

The Cobb-Douglas isoquants have the same nice, well-behaved shape 
that the Cobb-Douglas indifference curves have; as in the case of utility 
functions, the Cobb-Douglas production function is about the simplest ex¬ 
ample of well-behaved isoquants. 


18.4 Properties of Technology 

As in the case of consumers, it is common to assume certain properties 
about technology. First we will generally assume that technologies are 
monotonic: if you increase the amount of at least one of the inputs, it 
should be possible to produce at least as much output as you were pro¬ 
ducing originally. This is sometimes referred to as the property of free 
disposal: if the firm can costlessly dispose of any inputs, having extra 
inputs around can’t hurt it. 

Second, we will often assume that the technology is convex. This means 
that if you have two ways to produce y units of output, (xi, X 2 ) and {z\, Z 2 ), 
then their weighted average will produce at least y units of output. 

One argument for convex technologies goes as follows. Suppose that you 
have a way to produce 1 unit of output using ai units of factor 1 and 02 






PROPERTIES OF TECHNOLOGY 337 


units of factor 2 and that you have another way to produce 1 unit of output 
using b\ units of factor 1 and b 2 units of factor 2. We call these two ways 
to produce output production techniques. 

Furthermore, let us suppose that you are free to scale the output up by 
arbitrary amounts so that (lOOai, 100a2) and (100&i, IOO&2) will produce 
100 units of output. But now note that if you have 25ai + 75&i units of 
factor 1 and 25d2 + 75i>2 units of factor 2 you can still produce 100 units 
of output: just produce 25 units of the output using the “a” technique and 
75 units of the output using the “b” technique. 

This is depicted in Figure 18.4. By choosing the level at which you 
operate each of the two activities, you can produce a given amount of output 
in a variety of different ways. In particular, every input combination along 
the line connecting (100a 1; 100a2) and (IOO61,1006 2 ) will be a feasible way 
to produce 100 units of output. 



Convexity. If you can operate production activities indepen¬ 
dently, then weighted averages of production plans will also be 
feasible. Thus the isoquants will have a convex shape. 


In this kind of technology, where you can scale the production process up 
and down easily and where separate production processes don’t interfere 
with each other, convexity is a very natural assumption. 


Figure 

18.4 
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18.5 The Marginal Product 

Suppose that we are operating at some point, (xi, £ 2 ), and that we consider 
using a little bit more of factor 1 while keeping factor 2 fixed at the level 
X 2 ■ How much more output will we get per additional unit of factor 1? We 
have to look at the change in output per unit change of factor 1: 

A y _ f(x 1 + Axi, x 2 ) - f(x 1 , x 2 ) 

Axi Axi 

We call this the marginal product of factor 1. The marginal product 
of factor 2 is defined in a similar way, and we denote them by MP 1 (x 1 ,x 2 ) 
and MPz(xi,X 2 ), respectively. 

Sometimes we will be a bit sloppy about the concept of marginal product 
and describe it as the extra output we get from having “one” more unit of 
factor 1. As long as “one” is small relative to the total amount of factor 1 
that we are using, this will be satisfactory. But we should remember that 
a marginal product is a rate: the extra amount of output per unit of extra 
input. 

The concept of marginal product is just like the concept of marginal 
utility that we described in our discussion of consumer theory, except for 
the ordinal nature of utility. Here, we are discussing physical output: the 
marginal product of a factor is a specific number, which can, in principle, 
be observed. 


18.6 The Technical Rate of Substitution 


Suppose that we are operating at some point (x\,x 2 ) and that we consider 
giving up a little bit of factor 1 and using just enough more of factor 2 to 
produce the same amount of output y. How much extra of factor 2, Ax 2 , 
do we need if we are going to give up a little bit of factor 1, Axi? This 
is just the slope of the isoquant; we refer to it as the technical rate of 
substitution (TRS), and denote it by TRS(xi, £ 2 ). 

The technical rate of substitution measures the tradeoff between two 
inputs in production. It measures the rate at which the firm will have to 
substitute one input for another in order to keep output constant. 

To derive a formula for the TRS, we can use the same idea that we used 
to determine the slope of the indifference curve. Consider a change in our 
use of factors 1 and 2 that keeps output fixed. Then we have 

Ay = MP\{x\, X 2 ) Ax\ + MP 2 {xi,x 2 )Ax 2 = 0, 


which we can solve to get 


TRS(si,® 2 ) = 


Axn 


MP 1 (x 1> x 2 ) 


Axi MP 2 {x i , x 2 ) 

Note the similarity with the definition of the marginal rate of substitution. 
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18.7 Diminishing Marginal Product 

Suppose that we have certain amounts of factors 1 and 2 and we consider 
adding more of factor 1 while holding factor 2 fixed at a given level. What 
might happen to the marginal product of factor 1? 

As long as we have a monotonic technology, we know that the total 
output will go up as we increase the amount of factor 1. But it is natural 
to expect that it will go up at a decreasing rate. Let’s consider a specific 
example, the case of farming. 

One man on one acre of land might produce 100 bushels of corn. If we 
add another man and keep the same amount of land, we might get 200 
bushels of corn, so in this case the marginal product of an extra worker 
is 100. Now keep adding workers to this acre of land. Each worker may 
produce more output, but eventually the extra amount of corn produced 
by an extra worker will be less than 100 bushels. After 4 or 5 people are 
added the additional output per worker will drop to 90, 80, 70 ... or even 
fewer bushels of corn. If we get hundreds of workers crowded together on 
this one acre of land, an extra worker may even cause output to go down! 
As in the making of broth, extra cooks can make things worse. 

Thus we would typically expect that the marginal product of a factor 
will diminish as we get more and more of that factor. This is called the 
law of diminishing marginal product. It isn’t really a “law”; it’s just 
a common feature of most kinds of production processes. 

It is important to emphasize that the law of diminishing marginal prod¬ 
uct applies only when all other inputs are being held fixed. In the farming 
example, we considered changing only the labor input, holding the land 
and raw materials fixed. 


18.8 Diminishing Technical Rate of Substitution 

Another closely related assumption about technology is that of diminish¬ 
ing technical rate of substitution. This says that as we increase the 
amount of factor 1, and adjust factor 2 so as to stay on the same isoquant, 
the technical rate of substitution declines. Roughly speaking, the assump¬ 
tion of diminishing TRS means that the slope of an isoquant must decrease 
in absolute value as we move along the isoquant in the direction of increas¬ 
ing x\, and it must increase as we move in the direction of increasing xi. 
This means that the isoquants will have the same sort of convex shape that 
well-behaved indifference curves have. 

The assumptions of a diminishing technical rate of substitution and di¬ 
minishing marginal product are closely related but are not exactly the 
same. Diminishing marginal product is an assumption about how the mar¬ 
ginal product changes as we increase the amount of one factor, holding the 
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other factor fixed. Diminishing TRS is about how the ratio of the marginal 
products—the slope of the isoquant—changes as we increase the amount 
of one factor and reduce the amount of the other factor so as to stay on the 
same isoquant. 


18.9 The Long Run and the Short Run 

Let us return now to the original idea of a technology as being just a list 
of the feasible production plans. We may want to distinguish between the 
production plans that are immediately feasible and those that are eventually 
feasible. 

In the short run, there will be some factors of production that are fixed 
at predetermined levels. Our farmer described above might only consider 
production plans that involve a fixed amount of land, if that is all he has 
access to. It may be true that if he had more land, he could produce more 
corn, but in the short run he is stuck with the amount of land that he has. 

On the other hand, in the long run the farmer is free to purchase more 
land, or to sell some of the land he now owns. He can adjust the level of 
the land input so as to maximize his profits. 

The economist’s distinction between the long run and the short run is 
this: in the short run there is at least one factor of production that is fixed: 
a fixed amount of land, a fixed plant size, a fixed number of machines, or 
whatever. In the long run, all the factors of production can be varied. 

There is no specific time interval implied here. What is the long run and 
what is the short run depends on what kinds of choices we are examining. 
In the short run at least some factors are fixed at given levels, but in the 
long run the amount used of these factors can be changed. 

Let’s suppose that factor 2, say, is fixed at 22 in the short run. Then the 
relevant production function for the short run is f[x 1 , 22 )- We can plot the 
functional relation between output and 21 in a diagram like Figure 18.5. 

Note that we have drawn the short-run production function as getting 
flatter and flatter as the amount of factor 1 increases. This is just the law 
of diminishing marginal product in action again. Of course, it can easily 
happen that there is an initial region of increasing marginal returns where 
the marginal product of factor 1 increases as we add more of it. In the case 
of the farmer adding labor, it might be that the first few workers added 
increase output more and more because they would be able to divide up 
jobs efficiently, and so on. But given the fixed amount of land, eventually 
the marginal product of labor will decline. 


18.10 Returns to Scale 

Now let’s consider a different kind of experiment. Instead of increasing the 
amount of one input while holding the other input fixed, let’s increase the 
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Production function. This is a possible shape for a short-run 
production function. 


amount of all inputs to the production function. In other words, let’s scale 
the amount of all inputs up by some constant factor: for example, use twice 
as much of both factor 1 and factor 2. 

If we use twice as much of each input, how much output will we get? 
The most likely outcome is that we will get twice as much output. This is 
called the case of constant returns to scale. In terms of the production 
function, this means that two times as much of each input gives two times as 
much output. In the case of two inputs we can express this mathematically 
by 

2/(xi,x 2 ) = /( 2xi,2x 2 ). 

In general, if we scale all of the inputs up by some amount f, constant 
returns to scale implies that we should get t times as much output: 

tf(x i,x 2 ) = /(fxi,fx 2 ). 

We say that this is the likely outcome for the following reason: it should 
typically be possible for the firm to replicate what it was doing before. If 
the firm has twice as much of each input, it can just set up two plants side 
by side and thereby get twice as much output. With three times as much 
of each input, it can set up three plants, and so on. 

Note that it is perfectly possible for a technology to exhibit constant re¬ 
turns to scale and diminishing marginal product to each factor. Returns 
to scale describes what happens when you increase all inputs, while di¬ 
minishing marginal product describes what happens when you increase one 
of the inputs and hold the others fixed. 


Figure 

18.5 
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Constant returns to scale is the most “natural” case because of the repli¬ 
cation argument, but that isn’t to say that other things might not happen. 
For example, it could happen that if we scale up both inputs by some fac¬ 
tor t, we get more than t times as much output. This is called the case of 
increasing returns to scale. Mathematically, increasing returns to scale 
means that 

f(txi,tx 2 ) > tf{xi,x 2 ). 


for all t > 1. 

What would be an example of a technology that had increasing returns 
to scale? One nice example is that of an oil pipeline. If we double the 
diameter of a pipe, we use twice as much materials, but the cross section 
of the pipe goes up by a factor of 4. Thus we will likely be able to pump 
more than twice as much oil through it. 

(Of course, we can’t push this example too far. If we keep doubling the 
diameter of the pipe, it will eventually collapse of its own weight. Increasing 
returns to scale usually just applies over some range of output.) 

The other case to consider is that of decreasing returns to scale, 
where 

f(tx 1: tx 2 ) < tf(xi,x 2 ) 


for all t > 1. 

This case is somewhat peculiar. If we get less than twice as much output 
from having twice as much of each input, we must be doing something 
wrong. After all, we could just replicate what we were doing before! 

The usual way in which diminishing returns to scale arises is because we 
forgot to account for some input. If we have twice as much of every input 
but one, we won’t be able to exactly replicate what we were doing before, so 
there is no reason that we have to get twice as much output. Diminishing 
returns to scale is really a short-run phenomenon, with something being 
held fixed. 

Of course, a technology can exhibit different kinds of returns to scale 
at different levels of production. It may well happen that for low levels 
of production, the technology exhibits increasing returns to scale—as you 
scale all the inputs by some small amount t , the output increases by more 
than t. Later on, for larger levels of output, increasing scale by t may just 
increase output by the same factor t. 


EXAMPLE: Datacenters 

Datacenters are large buildings that house thousands of computers used 
to perform tasks such as serving web pages. Internet companies such as 
Google, Yahoo, Microsoft, Amazon, and many others have built thousands 
of datacenters around the world. 
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A typical datacenter consists of hundreds of racks which hold computer 
motherboards that are similar to the motherboard in your desktop com¬ 
puter. Generally these systems are designed to be easily scalable so that 
the computational power of the data center can scale up or down just by 
adding or removing racks of computers. 

The replication argument implies that the production function for com¬ 
puting services is effectively constant returns to scale: to double output, 
you simply double all inputs. 


EXAMPLE: Copy Exactly! 

Intel operates dozens of “fab plants” that fabricate, assemble, sort, and test 
advanced computer chips. Chip fabrication is such a delicate process that 
Intel found it difficult to manage quality in a heterogeneous environment. 
Even minor variations in plant design, such as cleaning procedures or the 
length of cooling hoses, could have a large impact on the yield of the fab 
process. 

In order to manage these very subtle effects, Intel moved to its Copy Ex¬ 
actly! process. According to Intel, the Copy Exactly directive is: “... everything 
which might affect the process, or how it is run, is to be copied down to 
the finest detail, unless it is either physically impossible to do so, or there 
is an overwhelming competitive benefit to introducing a change.” 

This means that one Intel plant is very much like another, and deliber¬ 
ately so. As the replication argument suggests, the easiest way to scale up 
production at Intel is to replicate current operating procedures as closely 
as possible. 


Summary 

1. The technological constraints of the firm are described by the production 
set, which depicts all the technologically feasible combinations of inputs 
and outputs, and by the production function, which gives the maximum 
amount of output associated with a given amount of the inputs. 

2. Another way to describe the technological constraints facing a firm is 
through the use of isoquants—curves that indicate all the combinations of 
inputs capable of producing a given level of output. 

3. We generally assume that isoquants are convex and monotonic, just like 
well-behaved preferences. 

4. The marginal product measures the extra output per extra unit of an 
input, holding all other inputs fixed. We typically assume that the marginal 
product of an input diminishes as we use more and more of that input. 
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5. The technical rate of substitution (TRS) measures the slope of an iso¬ 
quant. We generally assume that the TRS diminishes as we move out along 
an isoquant—which is another way of saying that the isoquant has a convex 
shape. 

6. In the short run some inputs are fixed, while in the long run all inputs 
are variable. 

7. Returns to scale refers to the way that output changes as we change 
the scale of production. If we scale all inputs up by some amount t and 
output goes up by the same factor, then we have constant returns to scale. 
If output scales up by more that t , we have increasing returns to scale; and 
if it scales up by less than t , we have decreasing returns to scale. 


REVIEW QUESTIONS 


1. Consider the production function f(x 1 , 2 : 2 ) = x\x\. Does this exhibit 
constant, increasing, or decreasing returns to scale? 

1 1 

2. Consider the production function f(x 1 , 2 : 2 ) = 4x^X2 ■ Does this exhibit 
constant, increasing, or decreasing returns to scale? 

3. The Cobb-Douglas production function is given by f(x 1 , 212 ) = Axfx\- 
It turns out that the type of returns to scale of this function will depend 
on the magnitude of a + b. Which values of a + b will be associated with 
the different kinds of returns to scale? 

4. The technical rate of substitution between factors *2 and x\ is —4. If 
you desire to produce the same amount of output but cut your use of x\ 
by 3 units, how many more units of X 2 will you need? 

5. True or false? If the law of diminishing marginal product did not hold, 
the world’s food supply could be grown in a flowerpot. 

6. In a production process is it possible to have decreasing marginal product 
in an input and yet increasing returns to scale? 


CHAPTER 



PROFIT 

MAXIMIZATION 


In the last chapter we discussed ways to describe the technological choices 
facing the firm. In this chapter we describe a model of how the firm chooses 
the amount to produce and the method of production to employ. The 
model we will use is the model of profit maximization: the firm chooses a 
production plan so as to maximize its profits. 

In this chapter we will assume that the firm faces fixed prices for its in¬ 
puts and outputs. We said earlier that economists call a market where the 
individual producers take the prices as outside their control a competitive 
market. So in this chapter we want to study the profit-maximization prob¬ 
lem of a firm that faces competitive markets for the factors of production 
it uses and the output goods it produces. 


19.1 Profits 

Profits are defined as revenues minus cost. Suppose that the firm produces 
n outputs (j/i,..., y n ) and uses m inputs (iri,..., x m ). Let the prices of the 
output goods be (pi,... ,p n ) and the prices of the inputs be (wi ,..., w m ). 
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The profits the firm receives, 7 r, can be expressed as 


n m 

7T = 'YhviVi - y^/WjXj. 

2=1 2=1 

The first term is revenue, and the second term is cost. 

In the expression for cost we should be sure to include all of the factors 
of production used by the firm, valued at their market price. Usually this 
is pretty obvious, but in cases where the firm is owned and operated by the 
same individual, it is possible to forget about some of the factors. 

For example, if an individual works in his own firm, then his labor is an 
input and it should be counted as part of the costs. His wage rate is simply 
the market price of his labor —what he would be getting if he sold his labor 
on the open market. Similarly, if a farmer owns some land and uses it in 
his production, that land should be valued at its market value for purposes 
of computing the economic costs. 

We have seen that economic costs like these are often referred to as op¬ 
portunity costs. The name comes from the idea that if you are using 
your labor, for example, in one application, you forgo the opportunity of 
employing it elsewhere. Therefore those lost wages are part of the cost of 
production. Similarly with the land example: the farmer has the oppor¬ 
tunity of renting his land to someone else, but he chooses to forgo that 
rental income in favor of renting it to himself. The lost rents are part of 
the opportunity cost of his production. 

The economic definition of profit requires that we value all inputs and 
outputs at their opportunity cost. Profits as determined by accountants do 
not necessarily accurately measure economic profits, as they typically use 
historical costs—what a factor was purchased for originally— rather than 
economic costs—what a factor would cost if purchased now. There are 
many variations on the use of the term “profit,” but we will always stick 
to the economic definition. 

Another confusion that sometimes arises is due to getting time scales 
mixed up. We usually think of the factor inputs as being measured in 
terms of flows. So many labor hours per week and so many machine hours 
per week will produce so much output per week. Then the factor prices will 
be measured in units appropriate for the purchase of such flows. Wages are 
naturally expressed in terms of dollars per hour. The analog for machines 
would be the rental rate —the rate at which you can rent a machine for 
the given time period. 

In many cases there isn’t a very well-developed market for the rental of 
machines, since firms will typically buy their capital equipment. In this 
case, we have to compute the implicit rental rate by seeing how much it 
would cost to buy a machine at the beginning of the period and sell it at 
the end of the period. 
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19.2 The Organization of Firms 

In a capitalist economy, firms are owned by individuals. Firms are only 
legal entities; ultimately it is the owners of firms who are responsible for 
the behavior of the firm, and it is the owners who reap the rewards or pay 
the costs of that behavior. 

Generally speaking, firms can be organized as proprietorships, partner¬ 
ships, or corporations. A proprietorship is a firm that is owned by a 
single individual. A partnership is owned by two or more individuals. A 
corporation is usually owned by several individuals as well, but under the 
law has an existence separate from that of its owners. Thus a partnership 
will last only as long as both partners are alive and agree to maintain its 
existence. A corporation can last longer than the lifetimes of any of its 
owners. For this reason, most large firms are organized as corporations. 

The owners of each of these different types of firms may have different 
goals with respect to managing the operation of the firm. In a proprietor¬ 
ship or a partnership the owners of the firm usually take a direct role in 
actually managing the day-to-day operations of the firm, so they are in a 
position to carry out whatever objectives they have in operating the firm. 
Typically, the owners would be interested in maximizing the profits of their 
firm, but, if they have nonprofit goals, they can certainly indulge in these 
goals instead. 

In a corporation, the owners of the corporation are often distinct from 
the managers of the corporation. Thus there is a separation of ownership 
and control. The owners of the corporation must define an objective for 
the managers to follow in their running of the firm, and then do their 
best to see that they actually pursue the goals the owners have in mind. 
Again, profit maximization is a common goal. As we’ll see below, this goal, 
properly interpreted, is likely to lead the managers of the firm to choose 
actions that are in the interests of the owners of the firm. 


19.3 Profits and Stock Market Value 

Often the production process that a firm uses goes on for many periods. 
Inputs put in place at time t pay off with a whole flow of services at later 
times. For example, a factory building erected by a firm could last for 50 
or 100 years. In this case an input at one point in time helps to produce 
output at other times in the future. 

In this case we have to value a flow of costs and a flow of revenues over 
time. As we’ve seen in Chapter 10, the appropriate way to do this is to 
use the concept of present value. When people can borrow and lend in 
financial markets, the interest rate can be used to define a natural price 
of consumption at different times. Firms have access to the same sorts of 
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financial markets, and the interest rate can be used to value investment 
decisions in exactly the same way. 

Consider a world of perfect certainty where a firm’s flow of future profits 
is publicly known. Then the present value of those profits would be the 
present value of the firm. It would be how much someone would be 
willing to pay to purchase the firm. 

As we indicated above, most large firms are organized as corporations, 
which means that they are jointly owned by a number of individuals. The 
corporation issues stock certificates to represent ownership of shares in the 
corporation. At certain times the corporation issues dividends on these 
shares, which represent a share of the profits of the firm. The shares of 
ownership in the corporation are bought and sold in the stock market. 
The price of a share represents the present value of the stream of dividends 
that people expect to receive from the corporation. The total stock market 
value of a firm represents the present value of the stream of profits that the 
firm is expected to generate. Thus the objective of the firm—maximizing 
the present value of the stream of profits the firm generates—could also 
be described as the goal of maximizing stock market value. In a world of 
certainty, these two goals are the same thing. 

The owners of the firm will generally want the firm to choose production 
plans that maximize the stock market value of the firm, since that will make 
the value of the shares they hold as large as possible. We saw in Chapter 
10 that whatever an individual’s tastes for consumption at different times, 
he or she will always prefer an endowment with a higher present value to 
one with a lower present value. By maximizing stock market value, a firm 
makes its shareholders’ budget sets as large as possible, and thereby acts 
in the best interests of all of its shareholders. 

If there is uncertainty about a firm’s stream of profits, then instructing 
managers to maximize profits has no meaning. Should they maximize ex¬ 
pected profits? Should they maximize the expected utility of profits? What 
attitude toward risky investments should the managers have? It is diffi¬ 
cult to assign a meaning to profit maximization when there is uncertainty 
present. However, in a world of uncertainty, maximizing stock market value 
still has meaning. If the managers of a firm attempt to make the value of 
the firm’s shares as large as possible then they make the firm’s owners—the 
shareholders—as well-off as possible. Thus maximizing stock market value 
gives a well-defined objective function to the firm in nearly all economic 
environments. 

Despite these remarks about time and uncertainty, we will generally limit 
ourselves to the examination of much simpler profit-maximization prob¬ 
lems, namely, those in which there is a single, certain output and a single 
period of time. This simple story still generates significant insights and 
builds the proper intuition to study more general models of firm behavior. 
Most of the ideas that we will examine carry over in a natural way to these 
more general models. 
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19.4 The Boundaries of the Firm 

One question that constantly confronts managers of firms is whether to 
“make or buy.” That is, should a firm make something internally or buy it 
from an external supplier? The question is broader than it sounds, as it can 
refer not only to physical goods, but also services of one sort or another. 
Indeed, in the broadest interpretation, “make or buy” applies to almost 
every decision a firm makes. 

Should a company provide its own cafeteria? Janitorial services? Pho¬ 
tocopying services? Travel assistance? Obviously, many factors enter into 
such decisions. One important consideration is size. A small mom-and-pop 
video store with 12 employees is probably not going to provide a cafeteria. 
But it might outsource janitorial services, depending on cost, capabilities, 
and staffing. 

Even a large organization, which could easily afford to operate food ser¬ 
vices, may or may not choose to do so, depending on availability of alter¬ 
natives. Employees of an organization located in a big city have access to 
many places to eat; if the organization is located in a remote area, choices 
may be fewer. 

One critical issue is whether the goods or services in question are exter¬ 
nally provided by a monopoly or by a competitive market. By and large, 
managers prefer to buy goods and services on a competitive market, if they 
are available. The second-best choice is dealing with an internal monop¬ 
olist. The worse choice of all, in terms of price and quality of service, is 
dealing with an external monopolist. 

Think about photocopying services. The ideal situation is to have dozens 
of competitive providers vying for your business; that way you will get 
cheap prices and high-quality service. If your school is large, or in an urban 
area, there may be many photocopying services vying for your business. On 
the other hand, small rural schools may have less choice and often higher 
prices. 

The same is true of businesses. A highly competitive environment gives 
lots of choices to users. By comparison, an internal photocopying division 
may be less attractive. Even if prices are low, the service could be sluggish. 
But the least attractive option is surely to have to submit to a single 
external provider. An internal monopoly provider may have bad service, 
but at least the money stays inside the firm. 

As technology changes, what is typically inside the firm changes. Forty 
years ago, firms managed many services themselves. Now they tend to 
outsource as much as possible. Food service, photocopying service, and 
janitorial services are often provided by external organizations that spe¬ 
cialize in such activities. Such specialization often allows these companies 
to provide higher quality and less expensive services to the organizations 
that use their services. 
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19.5 Fixed and Variable Factors 

In a given time period, it may be very difficult to adjust some of the inputs. 
Typically a firm may have contractual obligations to employ certain inputs 
at certain levels. An example of this would be a lease on a building, where 
the firm is legally obligated to purchase a certain amount of space over the 
period under examination. We refer to a factor of production that is in 
a fixed amount for the firm as a fixed factor. If a factor can be used in 
different amounts, we refer to it as a variable factor. 

As we saw in Chapter 18 , the short run is defined as that period of time 
in which there are some fixed factors—factors that can only be used in 
fixed amounts. In the long run, on the other hand, the firm is free to vary 
all of the factors of production: all factors are variable factors. 

There is no rigid boundary between the short run and the long run. The 
exact time period involved depends on the problem under examination. 
The important thing is that some of the factors of production are fixed in 
the short run and variable in the long run. Since all factors are variable in 
the long run, a firm is always free to decide to use zero inputs and produce 
zero output-- that is, to go out of business. Thus the least profits a firm 
can make in the long run are zero profits. 

In the short run, the firm is obligated to employ some factors, even if it 
decides to produce zero output. Therefore it is perfectly possible that the 
firm could make negative profits in the short run. 

By definition, fixed factors are factors of production that must be paid 
for even if the firm decides to produce zero output: if a firm has a long¬ 
term lease on a building, it must make its lease payments each period 
whether or not it decides to produce anything that period. But there is 
another category of factors that only need to be paid for if the firm decides 
to produce a positive amount of output. One example is electricity used 
for lighting. If the firm produces zero output, it doesn’t have to provide 
any lighting; but if it produces any positive amount of output, it has to 
purchase a fixed amount of electricity to use for lighting. 

Factors such as these are called quasi-fixed factors. They are factors of 
production that must be used in a fixed amount, independent of the output 
of the firm, as long as the output is positive. The distinction between 
fixed factors and quasi-fixed factors is sometimes useful in analyzing the 
economic behavior of the firm. 


19.6 Short-Run Profit Maximization 

Let’s consider the short-run profit-maximization problem when input 2 is 
fixed at some level X2- Let f(x 1,3:2) be the production function for the 
firm, let p be the price of output, and let w\ and W2 be the prices of the 
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two inputs. Then the profit-maximization problem facing the firm can be 
written as 

max pfix 1 , 0 : 2 ) — w\X\ — W 2 X 2 - 

Xl 

The condition for the optimal choice of factor 1 is not difficult to determine. 

If x\ is the profit-maximizing choice of factor 1, then the output price 
times the marginal product of factor 1 should equal the price of factor 1. 
In symbols, 

pMP 1 (x*,x 2 ) = uq. 

In other words, the value of the marginal product of a factor should equal 
its price. 

In order to understand this rule, think about the decision to employ a 
little more of factor 1. As you add a little more of it, Aaq, you produce 
Ay = MPiAxi more output that is worth pMPiAx\. But this marginal 
output costs wiAi’i to produce. If the value of marginal product exceeds 
its cost, then profits can be increased by increasing input 1. If the value 
of marginal product is less than its cost, then profits can be increased by 
decreasing the level of input 1. 

If the profits of the firm are as large as possible, then profits should 
not increase when we increase or decrease input 1. This means that at a 
profit-maximizing choice of inputs and outputs, the value of the marginal 
product, pMPi(xi,X 2 ), should equal the factor price, w±. 

We can derive the same condition graphically. Consider Figure 19.1. The 
curved line represents the production function holding factor 2 fixed at X 2 - 
Using y to denote the output of the firm, profits are given by 


n = py — w±Xi — w 2 X 2 - 


This expression can be solved for y to express output as a function of X \: 


7T W 2 _ W 1 

y = - + — x 2 h - x 1 . 

p p p 


(19.1) 


This equation describes isoprofit lines. These are just all combinations 
of the input goods and the output good that give a constant level of profit, 
7T. As 7T varies we get a family of parallel straight lines each with a slope of 
Wi/p and each having a vertical intercept of ir/p+W 2 X 2 /p , which measures 
the profits plus the fixed costs of the firm. 

The fixed costs are fixed, so the only thing that really varies as we move 
from one isoprofit line to another is the level of profits. Thus higher levels of 
profit will be associated with isoprofit lines with higher vertical intercepts. 

The profit-maximization problem is then to find the point on the produc¬ 
tion function that has the highest associated isoprofit line. Such a point 
is illustrated in Figure 19.1. As usual it is characterized by a tangency 
condition: the slope of the production function should equal the slope of 
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Figure 

19.1 



Profit maximization. The firm chooses the input and output 
combination that lies on the highest isoprofit line. In this case 
the profit-maximizing point is ( x\,y *). 


the isoprofit line. Since the slope of the production function is the marginal 
product, and the slope of the isoprofit line is w\/p : this condition can also 
be written as 

MP 1 = 

P 

which is equivalent to the condition we derived above. 


19.7 Comparative Statics 

We can use the geometry depicted in Figure 19.1 to analyze how a firm’s 
choice of inputs and outputs varies as the prices of inputs and outputs 
vary. This gives us one way to analyze the comparative statics of firm 
behavior. 

For example: how does the optimal choice of factor 1 vary as we vary its 
factor price w\l Referring to equation (19.1), which defines the isoprofit 
line, we see that increasing itq will make the isoprofit line steeper, as shown 
in Figure 19.2A. When the isoprofit line is steeper, the tangency must occur 
further to the left. Thus the optimal level of factor 1 must decrease. This 
simply means that as the price of factor 1 increases, the demand for factor 1 
must decrease: factor demand curves must slope downward. 

Similarly, if the output price decreases the isoprofit line must become 
steeper, as shown in Figure 19.2B. By the same argument as given in the 
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Comparative statics. Panel A shows that increasing wi will 
reduce the demand for factor 1. Panel B shows that increasing 
the price of output will increase the demand for factor 1 and 
therefore increase the supply of output. 


Figure 

19.2 


last paragraph the profit-maximizing choice of factor 1 will decrease. If the 
amount of factor 1 decreases and the level of factor 2 is fixed in the short 
run by assumption, then the supply of output must decrease. This gives us 
another comparative statics result: a reduction in the output price must 
decrease the supply of output. In other words, the supply function must 
slope upwards. 

Finally, we can ask what will happen if the price of factor 2 changes? 
Because this is a short-run analysis, changing the price of factor 2 will not 
change the firm’s choice of factor 2—in the short run, the level of factor 2 
is fixed at X 2 ■ Changing the price of factor 2 has no effect on the slope of 
the isoprofit line. Thus the optimal choice of factor 1 will not change, nor 
will the supply of output. All that changes are the profits that the firm 
makes. 


19.8 Profit Maximization in the Long Run 

In the long run the firm is free to choose the level of all inputs. Thus the 
long-run profit-maximization problem can be posed as 

max pf(x 1 , 2 : 2 ) — W 1 X 1 — ^ 2 ^ 2 - 

Xi,X2 


This is basically the same as the short-run problem described above, but 
now both factors are free to vary. 
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The condition describing the optimal choices is essentially the same as 
before, but now we have to apply it to each factor. Before we saw that 
the value of the marginal product of factor 1 must be equal to its price, 
whatever the level of factor 2. The same sort of condition must now hold 
for each factor choice: 

pMP\(x\,x 2 ) = Wl 
pMP 2 (x\,x 2 ) = w 2 . 

If the firm has made the optimal choices of factors 1 and 2, the value of 
the marginal product of each factor should equal its price. At the optimal 
choice, the firm’s profits cannot increase by changing the level of either 
input. 

The argument is the same as used for the short-run profit-maximizing 
decisions. If the value of the marginal product of factor 1, for example, 
exceeded the price of factor 1, then using a little more of factor 1 would 
produce MP\ more output, which would sell for pMP\ dollars. If the value 
of this output exceeds the cost of the factor used to produce it, it clearly 
pays to expand the use of this factor. 

These two conditions give us two equations in two unknowns, x\ and x 2 . 
If we know how the marginal products behave as a function of X\ and x 2 , 
we will be able to solve for the optimal choice of each factor as a function 
of the prices. The resulting equations are known as the factor demand 
curves. 


19.9 Inverse Factor Demand Curves 

The factor demand curves of a firm measure the relationship between 
the price of a factor and the profit-maximizing choice of that factor. We saw 
above how to find the profit-maximizing choices: for any prices, ( p , w i, w 2 ), 
we just find those factor demands, (x*,:^), such that the value of the 
marginal product of each factor equals its price. 

The inverse factor demand curve measures the same relationship, 
but from a different point of view. It measures what the factor prices must 
be for some given quantity of inputs to be demanded. Given the optimal 
choice of factor 2, we can draw the relationship between the optimal choice 
of factor 1 and its price in a diagram like that depicted in Figure 19.3. This 
is simply a graph of the equation 


pMPx(x i,x 2 ) = Wi. 


This curve will be downward sloping by the assumption of diminishing 
marginal product. For any level of x±, this curve depicts what the factor 
price must be in order to induce the firm to demand that level of X\ , holding 
factor 2 fixed at x 2 . 
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The inverse factor demand curve. This measures what the 
price of factor 1 must be to get x\ units demanded if the level 
of the other factor is held fixed at x 2 . 


19.10 Profit Maximization and Returns to Scale 

There is an important relationship between competitive profit maximiza¬ 
tion and returns to scale. Suppose that a firm has chosen a long-run profit- 
maximizing output y* = f(x\,x 2 ), which it is producing using input levels 

Then its profits are given by 

* * * * 

7 r = py —W\x 1 —W 2 X 2 - 

Suppose that this firm’s production function exhibits constant returns to 
scale and that it is making positive profits in equilibrium. Then consider 
what would happen if it doubled the level of its input usage. According to 
the constant returns to scale hypothesis, it would double its output level. 
What would happen to profits? 

It is not hard to see that its profits would also double. But this con¬ 
tradicts the assumption that its original choice was profit maximizing! We 
derived this contradiction by assuming that the original profit level was 
positive; if the original level of profits were zero there would be no prob¬ 
lem: two times zero is still zero. 

This argument shows that the only reasonable long-run level of profits 
for a competitive firm that has constant returns to scale at all levels of 
output is a zero level of profits. (Of course if a firm has negative profits in 
the long run, it should go out of business.) 


Figure 

19.3 
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Most people find this to be a surprising statement. Firms are out to 
maximize profits aren’t they? How can it be that they can only get zero 
profits in the long run? 

Think about what would happen to a firm that did try to expand indef¬ 
initely. Three things might occur. First, the firm could get so large that it 
could not really operate effectively. This is just saying that the firm really 
doesn’t have constant returns to scale at all levels of output. Eventually, 
due to coordination problems, it might enter a region of decreasing returns 
to scale. 

Second, the firm might get so large that it would totally dominate the 
market for its product. In this case there is no reason for it to behave 
competitively—to take the price of output as given. Instead, it would 
make sense for such a firm to try to use its size to influence the market 
price. The model of competitive profit maximization would no longer be 
a sensible way for the firm to behave, since it would effectively have no 
competitors. We’ll investigate more appropriate models of firm behavior 
in this situation when we discuss monopoly. 

Third, if one firm can make positive profits with a constant returns to 
scale technology, so can any other firm with access to the same technology. 
If one firm wants to expand its output, so would other firms. But if all firms 
expand their outputs, this will certainly push down the price of output and 
lower the profits of all the firms in the industry. 


19.11 Revealed Profitability 

When a profit-maximizing firm makes its choice of inputs and outputs 
it reveals two things: first, that the inputs and outputs used represent a 
feasible production plan, and second, that these choices are more profitable 
than other feasible choices that the firm could have made. Let us examine 
these points in more detail. 

Suppose that we observe two choices that the firm makes at two dif¬ 
ferent sets of prices. At time t, it faces prices (p* 1 ’w\,w t 2 ) and makes 

choices (j/ 4 , x\,x 2 ). At time s, it faces prices ( p s , w*, w |) and makes choices 

(y s ,x\,x 2 ). If the production function of the firm hasn’t changed between 
times s and t and if the firm is a profit maximizer, then we must have 

t t t t t t \ t s t s t s /in o\ 

PV - W\X\ - w 2 x 2 >py- w 1 x 1 - w 2 x 2 (19.2) 

and 

P s y s ~ w{x\ - w 2 x 2 > p 8 y t - w{x{ - w 2 x 2 . (19.3) 

That is, the profits that the firm achieved facing the t period prices must be 
larger than if they used the s period plan and vice versa. If either of these 
inequalities were violated, the firm could not have been a profit-maximizing 
firm (with an unchanging technology). 
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Thus if we ever observe two time periods where these inequalities are 
violated we would know that the firm was not maximizing profits in at least 
one of the two periods. The satisfaction of these inequalities is virtually 
an axiom of profit-maximizing behavior, so it might be referred to as the 
Weak Axiom of Profit Maximization (WAPM). 

If the firm’s choices satisfy WAPM, we can derive a useful comparative 
statics statement about the behavior of factor demands and output supplies 
when prices change. Transpose the two sides of equation (19.3) to get 

—p s y t + w{x\ + w 2 x\ > —p s y s + w{x{ + w 2 x f (19-4) 

and add equation (19.4) to equation (19.2) to get 

(p* - p s )y‘ - (w\ - w{)x\ - (w\ - w s 2 )x\ 

> (p* - P s )y s - (w\ - w{)x{ - (w\ - w s 2 )x s 2 . (19.5) 

Now rearrange this equation to yield 

(P* -P s )(2/‘ - y s ) ~ - w{)(x\ - x?) - (w\ - w s 2 )(xl - x s 2 ) > 0. (19.6) 

Finally define the change in prices, Ap = (p ( — p s ), the change in output, 
Ay = (y* - y s ), and so on to find 

ApAy — AwiAxi — Aw 2 Ax 2 > 0. (19.7) 

This equation is our final result. It says that the change in the price of 
output times the change in output minus the change in each factor price 
times the change in that factor must be nonnegative. This equation comes 
solely from the definition of profit maximization. Yet it contains all of the 
comparative statics results about profit-maximizing choices! 

For example, suppose that we consider a situation where the price of 
output changes, but the price of each factor stays constant. If Awi = 
Aw 2 = 0, then equation (19.7) reduces to 

ApAy > 0. 

Thus if the price of output goes up, so that Ap > 0, then the change in 
output must be nonnegative as well, Ay > 0. This says that the profit- 
maximizing supply curve of a competitive firm must have a positive (or at 
least a zero) slope. 

Similarly, if the price of output and of factor 2 remain constant, equation 
(19.7) becomes 

—Aw\Axi > 0 , 


which is to say 


AuqAxi < 0. 
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Thus if the price of factor 1 goes up, so that Awi > 0, then equation 
(19.7) implies that the demand for factor 1 will go down (or at worst stay 
the same), so that Aaq < 0. This means that the factor demand curve 
must be a decreasing function of the factor price: factor demand curves 
have a negative slope. 

The simple inequality in WAPM, and its implication in equation (19.7), 
give us strong observable restrictions about how a firm will behave. It 
is natural to ask whether these are all of the restrictions that the model 
of profit maximization imposes on firm behavior. Said another way, if we 
observe a firm’s choices, and these choices satisfy WAPM, can we construct 
an estimate of the technology for which the observed choices are profit- 
maximizing choices? It turns out that the answer is yes. Figure 19.4 shows 
how to construct such a technology. 



y 

Isoprofit line 
for period s 

Isoprofit line 
for period f 


n,/p t 

njp s 

(* % 

V *j) 





*1 

Figure 

19.4 

Construction of a possible technology. If the observed 
choices are maximal profit choices at each set of prices, then we 
can estimate the shape of the technology that generated those 
choices by using the isoprofit lines. 


In order to illustrate the argument graphically, we suppose that there 
is one input and one output. Suppose that we are given an observed 
choice in period t and in period s, which we indicate by (p*, w \, t/ 4 , x\) 
and (p s , wf, y s , xf ). In each period we can calculate the profits 7r s and 7r t 
and plot all the combinations of y and x\ that yield these profits. 

That is, we plot the two isoprofit lines 

TTt = v l y - w\xi 
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and 

7T S = V s V - w{xi_. 

The points above the isoprofit line for period t have higher profits than 
7 Tt at period t prices, and the points above the isoprofit line for period s 
have higher profits than 7r s at period s prices. WAPM requires that the 
choice in period t must lie below the period s isoprofit line and that the 
choice in period s must lie below the period t isoprofit line. 

If this condition is satisfied, it is not hard to generate a technology for 
which and (y s ,xl) are profit-maximizing choices. Just take the 

shaded area beneath the two lines. These are all of the choices that yield 
lower profits than the observed choices at both sets of prices. 

The proof that this technology will generate the observed choices as 
profit-maximizing choices is clear geometrically. At the prices (p 4 ,w4), the 
choice {y t ,x\) is on the highest isoprofit line possible, and the same goes 
for the period s choice. 

Thus, when the observed choices satisfy WAPM, we can “reconstruct” 
an estimate of a technology that might have generated the observations. 
In this sense, any observed choices consistent with WAPM could be profit- 
maximizing choices. As we observe more choices that the firm makes, we get 
a tighter estimate of the production function, as illustrated in Figure 19.5. 

This estimate of the production function can be used to forecast firm 
behavior in other environments or for other uses in economic analysis. 



Estimating the technology. As we observe more choices we 
get a tighter estimate of the production function. 


Figure 

19.5 
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EXAMPLE: How Do Farmers React to Price Supports? 

The U.S. government currently spends between $40 and $60 billion a year 
in aid to farmers. A large fraction of this amount is used to subsidize 
the production of various products including milk, wheat, corn, soybeans, 
and cotton. Occasionally, attempts are made to reduce or eliminate these 
subsidies. The effect of elimination of these subsidies would be to reduce 
the price of the product received by the farmers. 

Farmers sometimes argue that eliminating the subsidies to milk, for ex¬ 
ample, would not reduce the total supply of milk, since dairy farmers would 
choose to increase their herds and their supply of milk so as to keep their 
standard of living constant. 

If farmers are behaving so as to maximize profits, this is impossible. As 
we’ve seen above, the logic of profit maximization requires that a decrease 
in the price of an output leads to a reduction in its supply: if A p is negative, 
then A y must be negative as well. 

It is certainly possible that small family farms have goals other than sim¬ 
ple maximization of profits, but larger “agribusiness” farms are more likely 
to be profit maximizers. Thus the perverse response to the elimination of 
subsidies alluded to above could only occur on a limited scale, if at all. 


19.12 Cost Minimization 

If a firm is maximizing profits and if it chooses to supply some output y, 
then it must be minimizing the cost of producing y. If this were not so, then 
there would be some cheaper way of producing y units of output, which 
would mean that the firm was not maximizing profits in the first place. 

This simple observation turns out to be quite useful in examining firm 
behavior. It turns out to be convenient to break the profit-maximization 
problem into two stages: first we figure out how to minimize the costs of 
producing any desired level of output y , then we figure out which level of 
output is indeed a profit-maximizing level of output. We begin this task in 
the next chapter. 


Summary 

1. Profits are the difference between revenues and costs. In this definition 
it is important that all costs be measured using the appropriate market 
prices. 

2. Fixed factors are factors whose amount is independent of the level of 
output; variable factors are factors whose amount used changes as the level 
of output changes. 
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3. In the short run, some factors must be used in predetermined amounts. 
In the long run, all factors are free to vary. 

4. If the firm is maximizing profits, then the value of the marginal product 
of each factor that it is free to vary must equal its factor price. 

5. The logic of profit maximization implies that the supply function of a 
competitive firm must be an increasing function of the price of output and 
that each factor demand function must be a decreasing function of its price. 

6 . If a competitive firm exhibits constant returns to scale, then its long-run 
maximum profits must be zero. 


REVIEW QUESTIONS 


1. In the short run, if the price of the fixed factor is increased, what will 
happen to profits? 

2. If a firm had everywhere increasing returns to scale, what would happen 
to its profits if prices remained fixed and if it doubled its scale of operation? 

3. If a firm had decreasing returns to scale at all levels of output and it 
divided up into two equal-size smaller firms, what would happen to its 
overall profits? 

4. A gardener exclaims: “For only $1 in seeds I’ve grown over $20 in pro¬ 
duce!” Besides the fact that most of the produce is in the form of zucchini, 
what other observations would a cynical economist make about this situa¬ 
tion? 

5. Is maximizing a firm’s profits always identical to maximizing the firm’s 
stock market value? 

6 . If pMP\ > w i, then should the firm increase or decrease the amount of 
factor 1 in order to increase profits? 

7. Suppose a firm is maximizing profits in the short run with variable factor 
x\ and fixed factor X 2 - If the price of X 2 goes down, what happens to the 
firm’s use of aq? What happens to the firm’s level of profits? 

8 . A profit-maximizing competitive firm that is making positive profits 
in long-run equilibrium (may/may not) have a technology with constant 
returns to scale. 
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APPENDIX 

The profit-maximization problem of the firm is 

maxp/(ii,X2) — will — W2X2, 


which has first-order conditions 


df(xt,x* 2 ) 
^ dxi 

df{xl,x*2) 
P 8x2 


— wi = 0 

— W 2 = 0 . 


These are just the same as the marginal product conditions given in the text. 
Let’s see how profit-maximizing behavior looks using the Cobb-Douglas produc¬ 
tion function. 

Suppose the Cobb-Douglas function is given by /(£1,3:2) = x^x^- Then the 
two first-order conditions become 

a— 1 b 

pax ± X 2 ~ Wl = 0 
pbx 1X2 1 — w 2 = 0 . 


Multiply the first equation by x\ and the second equation by X2 to get 


pax\x 2 — W1X1 = 0 

pbxlx 2 — W2X2 = 0 . 


Using y = X1X2 to denote the level of output of this firm we can rewrite these 
expressions as 

pay — W1X1 
pby = ui2 ab¬ 


solving for xi and X2 we have 


Xl 


x 2 


gpy 

Wl 

bpy 

W 2 


This gives us the demands for the two factors as a function of the optimal output 
choice. But we still have to solve for the optimal choice of output. Inserting the 
optimal factor demands into the Cobb-Douglas production function, we have the 
expression 



Factoring out the y gives 
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Or 



This gives us the supply function of the Cobb-Douglas firm. Along with the 
factor demand functions derived above it gives us a complete solution to the 
profit-maximization problem. 

Note that when the firm exhibits constant returns to scale—when a + b = 1 
this supply function is not well defined. As long as the output and input prices are 
consistent with zero profits, a firm with a Cobb-Douglas technology is indifferent 
about its level of supply. 




CHAPTER 



COST 

MINIMIZATION 


Our goal is to study the behavior of profit-maximizing firms in both com¬ 
petitive and noncompetitive market environments. In the last chapter we 
began our investigation of profit-maximizing behavior in a competitive en¬ 
vironment by examining the profit-maximization problem directly. 

However, some important insights can be gained through a more indirect 
approach. Our strategy will be to break up the profit-maximization prob¬ 
lem into two pieces. First, we will look at the problem of how to minimize 
the costs of producing any given level of output, and then we will look at 
how to choose the most profitable level of output. In this chapter we’ll look 
at the first step—minimizing the costs of producing a given level of output. 


20.1 Cost Minimization 

Suppose that we have two factors of production that have prices W\ and 
W 2 , and that we want to figure out the cheapest way to produce a given 
level of output, y. If we let x\ and X 2 measure the amounts used of the 
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two factors and let f(x 1 , 2 : 2 ) be the production function for the firm, we 
can write this problem as 


min W 1 X 1 + W 2 X 2 

Xl ,X2 


such that f(x 1 , 2 : 2 ) = y. 

The same warnings apply as in the preceding chapter concerning this sort 
of analysis: make sure that you have included all costs of production in 
the calculation of costs, and make sure that everything is being measured 
on a compatible time scale. 

The solution to this cost-minimization problem—the minimum costs nec¬ 
essary to achieve the desired level of output—will depend on w 1 , w 2l and y, 
so we write it as c{wi,w 2 ,y)- This function is known as the cost function 
and will be of considerable interest to us. The cost function c(wi,w 2 ,y) 
measures the minimal costs of producing y units of output when factor 
prices are (wi,w 2 ). 

In order to understand the solution to this problem, let us depict the costs 
and the technological constraints facing the firm on the same diagram. The 
isoquants give us the technological constraints—all the combinations of aq 
and X 2 that can produce y. 

Suppose that we want to plot all the combinations of inputs that have 
some given level of cost, C. We can write this as 

W1X1 + W2X2 = C , 


which can be rearranged to give 


x 2 


C Wl 
- x 1 . 

W 2 U>2 


It is easy to see that this is a straight line with a slope of —uq/uq and a 
vertical intercept of C/w 2 - As we let the number C vary we get a whole 
family of isocost lines. Every point on an isocost curve has the same cost, 
C, and higher isocost lines are associated with higher costs. 

Thus our cost-minimization problem can be rephrased as: find the point 
on the isoquant that has the lowest possible isocost line associated with it. 
Such a point is illustrated in Figure 20.1. 

Note that if the optimal solution involves using some of each factor, and 
if the isoquant is a nice smooth curve, then the cost-minimizing point will 
be characterized by a tangency condition: the slope of the isoquant must 
be equal to the slope of the isocost curve. Or, using the terminology of 
Chapter 18, the technical rate of substitution must equal the factor price 
ratio: 


MP 1 (xt,x* 2 ) 

MP 2 {x\,x* 2 ) 


TRS(a 


1 > * t '2 


) = - 


W\ 

w 2 


( 20 . 1 ) 
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Figure 

20.1 



Cost minimization. The choice of factors that minimize pro¬ 
duction costs can be determined by finding the point on the 
isoquant that has the lowest associated isocost curve. 


(If we have a boundary solution where one of the two factors isn’t used, 
this tangency condition need not be met. Similarly, if the production func¬ 
tion has “kinks,” the tangency condition has no meaning. These exceptions 
are just like the situation with the consumer, so we won’t emphasize these 
cases in this chapter.) 

The algebra that lies behind equation (20.1) is not difficult. Consider 
any change in the pattern of production (Axi,Ax 2 ) that keeps output 
constant. Such a change must satisfy 

MP^xl,x* 2 )A Xl + MP 2 (xl,x* 2 )Ax 2 = 0- (20.2) 

Note that Axi and Ax 2 must be of opposite signs; if you increase the 
amount used of factor 1 you must decrease the amount used of factor 2 in 
order to keep output constant. 

If we are at the cost minimum, then this change cannot lower costs, so 
we have 

u>i Axi + W 2 AX 2 > 0. (20.3) 

Now consider the change (—Axi,~Ax 2 ). This also produces a constant 
level of output, and it too cannot lower costs. This implies that 


w \Axi — W 2 A.T 2 > 0. 


(20.4) 
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Putting expressions (20.3) and (20.4) together gives us 

w±Axi + W 2 A.X 2 = 0. (20.5) 

Solving equations (20.2) and (20.5) for Aa^/Aaq gives 

AX 2 w± MP\(x\,X2) 

Ax\ W2 MP 2 (x\,X2)' 

which is just the condition for cost minimization derived above by a geo¬ 
metric argument. 

Note that Figure 20.1 bears a certain resemblance to the solution to 
the consumer-choice problem depicted earlier. Although the solutions look 
the same, they really aren’t the same kind of problem. In the consumer 
problem, the straight line was the budget constraint, and the consumer 
moved along the budget constraint to find the most-preferred position. In 
the producer problem, the isoquant is the technological constraint and the 
producer moves along the isoquant to find the optimal position. 

The choices of inputs that yield minimal costs for the firm will in general 
depend on the input prices and the level of output that the firm wants 
to produce, so we write these choices as Xi(wi,W 2 ,y) and X 2 (wi,W 2 ,y)- 
These are called the conditional factor demand functions, or derived 
factor demands. They measure the relationship between the prices and 
output and the optimal factor choice of the firm, conditional on the firm 
producing a given level of output, y. 

Note carefully the difference between the conditional factor demands and 
the profit-maximizing factor demands discussed in the last chapter. The 
conditional factor demands give the cost-minimizing choices for a given level 
of output; the profit-maximizing factor demands give the profit-maximizing 
choices for a given price of output. 

Conditional factor demands are usually not directly observed; they are 
a hypothetical construct. They answer the question of how much of each 
factor would the firm use if it wanted to produce a given level of output 
in the cheapest way. However, the conditional factor demands are useful 
as a way of separating the problem of determining the optimal level of 
output from the problem of determining the most cost-effective method of 
production. 


EXAMPLE: Minimizing Costs for Specific Technologies 

Suppose that we consider a technology where the factors are perfect com¬ 
plements, so that f(x 1 , 2 : 2 ) = min{a:i, ^ 2 }. Then if we want to produce y 
units of output, we clearly need y units of x\ and y units of X 2 ■ Thus the 
minimal costs of production will be 


c(w 1 ,w 2 , y) = wiy + w 2 y = (uq + w 2 )y. 
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What about the perfect substitutes technology, f(x \,x 2 ) = X\ + a; 2 ? 
Since goods 1 and 2 are perfect substitutes in production it is clear that 
the firm will use whichever is cheaper. Thus the minimum cost of producing 
y units of output will be w\y or W 2 y , whichever is less. In other words: 

c(w!,W 2 ,y) = mm{w 1 y,w 2 y} = mm{w 1 ,w 2 }y. 

Finally, we consider the Cobb-Douglas technology, which is described by 
the formula f(xi,x 2 ) = x®x 2 . In this case we can use calculus techniques 
to show that the cost function will have the form 

c(w\,w 2 ,y) = Kwi +b w 2 +b y °^, 

where K is a constant that depends on a and b. The details of the calcu¬ 
lation are presented in the Appendix. 


20.2 Revealed Cost Minimization 

The assumption that the firm chooses factors to minimize the cost of pro¬ 
ducing output will have implications for how the observed choices change 
as factor prices change. 

Suppose that we observe two sets of prices, (w{,w 2 ) and (w{,w 2 ), and 
the associated choices of the firm, (x\,x 2 ) and (xf,x 2 ). Suppose that each 
of these choices produces the same output level y. Then if each choice is a 
cost-minimizing choice at its associated prices, we must have 

w\x\ + w 2 x 2 < w\x\ + w 2 x 2 


and 

S . S s t . s t 

VJ X X X + w 2 x 2 < w 1 x 1 + w 2 x 2 . 

If the firm is always choosing the cost-minimizing way to produce y units 
of output, then its choices at times t and s must satisfy these inequali¬ 
ties. We will refer to these inequalities as the Weak Axiom of Cost 
Minimization (WACM). 

Write the second equation as 

— w{x\ — w 2 x\ < — w{xl — w 2 x 2 

and add it to the first equation to get 

{w\ - w{)x\ + (w 2 - w 2 )xl < (w{ - w{)x{ + {w 2 - w 2 )x 2 , 

which can be rearranged to give us 

( w\ - w{)(x{ - xj) + (w 2 - w 2 )(x\ - x 2 ) < 0. 
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Using the delta notation to depict the changes in the factor demands 
and factor prices, we have 


AuqAaq + Au^Aa^ < 0. 


This equation follows solely from the assumption of cost-minimizing be¬ 
havior. It implies restrictions on how the firm’s behavior can change when 
input prices change and output remains constant. 

For example, if the price of the first factor increases and the price of the 
second factor stays constant, then Aw 2 = 0, so the inequality becomes 


AuqAaq < 0. 


If the price of factor 1 increases, then this inequality implies that the 
demand for factor 1 must decrease; thus the conditional factor demand 
functions must slope down. 

What can we say about how the minimal costs change as we change the 
parameters of the problem? It is easy to see that costs must increase if 
either factor price increases: if one good becomes more expensive and the 
other stays the same, the minimal costs cannot go down and in general will 
increase. Similarly, if the firm chooses to produce more output and factor 
prices remain constant, the firm’s costs will have to increase. 


20.3 Returns to Scale and the Cost Function 

In Chapter 18 we discussed the idea of returns to scale for the production 
function. Recall that a technology is said to have increasing, decreasing, 
or constant returns to scale as f(tx\,tx 2 ) is greater, less than, or equal to 
tf(x 1 , 2 : 2 ) for all t > 1. It turns out that there is a nice relation between 
the kind of returns to scale exhibited by the production function and the 
behavior of the cost function. 

Suppose first that we have the natural case of constant returns to scale. 
Imagine that we have solved the cost-minimization problem to produce 1 
unit of output, so that we know the unit cost function, c(wi, W 2 , 1). Now 
what is the cheapest way to produce y units of output? Simple: we just 
use y times as much of every input as we were using to produce 1 unit 
of output. This would mean that the minimal cost to produce y units of 
output would just be c(«Ji, u> 2 ,1 )y. In the case of constant returns to scale, 
the cost function is linear in output. 

What if we have increasing returns to scale? In this case it turns out that 
costs increase less than linearly in output. If the firm decides to produce 
twice as much output, it can do so at less than twice the cost, as long as 
the factor prices remain fixed. This is a natural implication of the idea of 
increasing returns to scale: if the firm doubles its inputs, it will more than 
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double its output. Thus if it wants to produce double the output, it will 
be able to do so by using less than twice as much of every input. 

But using twice as much of every input will exactly double costs. So 
using less than twice as much of every input will make costs go up by less 
than twice as much: this is just saying that the cost function will increase 
less than linearly with respect to output. 

Similarly, if the technology exhibits decreasing returns to scale, the cost 
function will increase more than linearly with respect to output. If output 
doubles, costs will more than double. 

These facts can be expressed in terms of the behavior of the average 
cost function. The average cost function is simply the cost per unit to 
produce y units of output: 

AC(y) = 

If the technology exhibits constant returns to scale, then we saw above 
that the cost function had the form c(w±,W 2 ,y) = c(wi,W 2 ,l)y- This 
means that the average cost function will be 


Ant \ c(w 1 ,w 2 ,l)y , 

AC{wi,W 2 ,y) = -= c{wi,W 2 ,l). 

y 

That is, the cost per unit of output will be constant no matter what level 
of output the firm wants to produce. 

If the technology exhibits increasing returns to scale, then the costs will 
increase less than linearly with respect to output, so the average costs will 
be declining in output: as output increases, the average costs of production 
will tend to fall. 

Similarly, if the technology exhibits decreasing returns to scale, then 
average costs will rise as output increases. 

As we saw earlier, a given technology can have regions of increasing, 
constant, or decreasing returns to scale—output can increase more rapidly, 
equally rapidly, or less rapidly than the scale of operation of the firm at 
different levels of production. Similarly, the cost function can increase less 
rapidly, equally rapidly, or more rapidly than output at different levels 
of production. This implies that the average cost function may decrease, 
remain constant, or increase over different levels of output. In the next 
chapter we will explore these possibilities in more detail. 

From now on we will be most concerned with the behavior of the cost 
function with respect to the output variable. For the most part we will 
regard the factor prices as being fixed at some predetermined levels and 
only think of costs as depending on the output choice of the firm. Thus for 
the remainder of the book we will write the cost function as a function of 
output alone: c{y). 
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20.4 Long-Run and Short-Run Costs 

The cost function is defined as the minimum cost of achieving a given level 
of output. Often it is important to distinguish the minimum costs if the 
firm is allowed to adjust all of its factors of production from the minimum 
costs if the firm is only allowed to adjust some of its factors. 

We have defined the short run to be a time period where some of the 
factors of production must be used in a fixed amount. In the long run, 
all factors are free to vary. The short-run cost function is defined as 
the minimum cost to produce a given level of output, only adjusting the 
variable factors of production. The long-run cost function gives the 
minimum cost of producing a given level of output, adjusting all of the 
factors of production. 

Suppose that in the short run factor 2 is fixed at some predetermined 
level X 2 , but in the long run it is free to vary. Then the short-run cost 
function is defined by 


c s (y, x 2 ) = minroiXi + w 2 x 2 

XI 

such that f(xi,x 2 ) = y- 

Note that in general the minimum cost to produce y units of output in the 
short run will depend on the amount and cost of the fixed factor that is 
available. 

In the case of two factors, this minimization problem is easy to solve: we 
just find the smallest amount of X\ such that f(xi,x 2 ) = y- However, if 
there are many factors of production that are variable in the short run the 
cost-minimization problem will involve more elaborate calculation. 

The short-run factor demand function for factor 1 is the amount of fac¬ 
tor 1 that minimizes costs. In general it will depend on the factor prices 
and on the levels of the fixed factors as well, so we write the short-run 
factor demands as 

xi = xl(wi,W2,X2,y) 

x 2 = x 2 . 

These equations just say, for example, that if the building size is fixed 
in the short run, then the number of workers that a firm wants to hire at 
any given set of prices and output choice will typically depend on the size 
of the building. 

Note that by definition of the short-run cost function 

c s (y,x 2 ) = w 1 x s 1 (w 1 ,w 2 ,x 2 ,y) + w 2 x 2 . 

This just says that the minimum cost of producing output y is the cost 
associated with using the cost-minimizing choice of inputs. This is true by 
definition but turns out to be useful nevertheless. 
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The long-run cost function in this example is defined by 
c(y) = min uqaq + W 2 X 2 

Xl,X2 


such that /(£ 1 , 2 : 2 ) = V- 

Here both factors are free to vary. Long-run costs depend only on the level 
of output that the firm wants to produce along with factor prices. We write 
the long-run cost function as c(y), and write the long-run factor demands 
as 

x\ = xi(wi,w 2 ,y) 

X 2 = X 2 (w 1 ,W 2 ,y). 

We can also write the long-run cost function as 

c(y) = w 1 xi(wi,w 2 ,y) + W 2 X 2 (w 1 ,W 2 ,y). 

Just as before, this simply says that the minimum costs are the costs that 
the firm gets by using the cost-minimizing choice of factors. 

There is an interesting relation between the short-run and the long-run 
cost functions that we will use in the next chapter. For simplicity, let us 
suppose that factor prices are fixed at some predetermined levels and write 
the long-run factor demands as 


xi = xi(y) 
X2 = x 2 {y). 


Then the long-run cost function can also be written as 

c{y) = c s (y, x 2 (y))- 

To see why this is true, just think about what it means. The equation says 
that the minimum costs when all factors are variable is just the minimum 
cost when factor 2 is fixed at the level that minimizes long-run costs. It fol¬ 
lows that the long-run demand for the variable factor—the cost-minimizing 
choice—is given by 


xi(wi,ui 2 , 2 /) = Xi(wi, w 2 ,x 2 {y),y). 

This equation says that the cost-minimizing amount of the variable factor 
in the long run is that amount that the firm would choose in the short 
run—if it happened to have the long-run cost-minimizing amount of the 
fixed factor. 
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20.5 Fixed and Quasi-Fixed Costs 


In Chapter 19 we made the distinction between fixed factors and quasi- 
fixed factors. Fixed factors are factors that must receive payment whether 
or not any output is produced. Quasi-fixed factors must be paid only if the 
firm decides to produce a positive amount of output. 

It is natural to define fixed costs and quasi-fixed costs in a similar man¬ 
ner. Fixed costs are costs associated with the fixed factors: they are 
independent of the level of output, and, in particular, they must be paid 
whether or not the firm produces output. Quasi-fixed costs are costs 
that are also independent of the level of output, but only need to be paid 
if the firm produces a positive amount of output. 

There are no fixed costs in the long run, by definition. However, there 
may easily be quasi-fixed costs in the long run. If it is necessary to spend 
a fixed amount of money before any output at all can be produced, then 
quasi-fixed costs will be present. 


20.6 Sunk Costs 

Sunk costs are another kind of fixed costs. The concept is best explained by 
example. Suppose that you have decided to lease an office for a year. The 
monthly rent that you have committed to pay is a fixed cost, since you are 
obligated to pay it regardless of the amount of output you produce. Now 
suppose that you decide to refurbish the office by painting it and buying 
furniture. The cost for paint is a fixed cost, but it is also a sunk cost since 
it is a payment that is made and cannot be recovered. The cost of buying 
the furniture, on the other hand, is not entirely sunk, since you can resell 
the furniture when you are done with it. It’s only the difference between 
the cost of new and used furniture that is sunk. 

To spell this out in more detail, suppose that you borrow $20,000 at the 
beginning of the year at, say, 10 percent interest. You sign a lease to rent 
an office and pay $12,000 in advance rent for next year. You spend $6,000 
on office furniture and $2,000 to paint the office. At the end of the year 
you pay back the $20,000 loan plus the $2,000 interest payment and sell 
the used office furniture for $5,000. 

Your total sunk costs consist of the $12,000 rent, the $2,000 of interest, 
the $2,000 of paint, but only $1,000 for the furniture, since $5,000 of the 
orginal furniture expenditure is recoverable. 

The difference between sunk costs and recoverable costs can be quite 
significant. A $100,000 expenditure to purchase five light trucks sounds 
like a lot of money, but if they can later be sold on the used truck market 
for $80,000, the actual sunk cost is only $20,000. A $100,000 expenditure 
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on a custom-made press for stamping out gizmos that has a zero resale 
value is quite different; in this case the entire expenditure is sunk. 

The best way to keep these issues straight is to make sure to treat all 
expenditures on a flow basis: how much does it cost to do business for 
a year? That way, one is less likely to forget the resale value of capital 
equipment and more likely to keep the distinction between sunk costs and 
recoverable costs clear. 


Summary 

1. The cost function, c(wi,w 2 ,y), measures the minimum costs of produc¬ 
ing a given level of output at given factor prices. 

2. Cost-minimizing behavior imposes observable restrictions on choices that 
firms make. In particular, conditional factor demand functions will be neg¬ 
atively sloped. 

3. There is an intimate relationship between the returns to scale exhibited 
by the technology and the behavior of the cost function. Increasing returns 
to scale implies decreasing average cost, decreasing returns to scale implies 
increasing average cost, and constant returns to scale implies constant av¬ 
erage cost. 

4. Sunk costs are costs that are not recoverable. 

REVIEW QUESTIONS 


1. Prove that a profit-maximizing firm will always minimize costs. 

2 . If a firm is producing where MP\/w\ > MP 2 /w 2 , what can it do to 
reduce costs but maintain the same output? 

3. Suppose that a cost-minimizing firm uses two inputs that are perfect 
substitutes. If the two inputs are priced the same, what do the conditional 
factor demands look like for the inputs? 

4. The price of paper used by a cost-minimizing firm increases. The firm 
responds to this price change by changing its demand for certain inputs, 
but it keeps its output constant. What happens to the firm’s use of paper? 

5. If a firm uses n inputs (n > 2), what inequality does the theory of 
revealed cost minimization imply about changes in factor prices (A wi) and 
the changes in factor demands (Ax,) for a given level of output? 
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APPENDIX 

Let us study the cost-minimization problem posed in the text using the opti¬ 
mization techniques introduced in Chapter 5. The problem is a constrained- 
minimization problem of the form 

min W1X1 + 102*2 

xi ,x 2 


such that f(xi, X2) = y- 

Recall that we had several techniques to solve this kind of problem. One way 
was to substitute the constraint into the objective function. This can still be 
used when we have a specific functional form for /(* 1, £2), but isn’t much use in 
the general case. 

The second method was the method of Lagrange multipliers and that works 
fine. To apply this method we set up the Lagrangian 

L = wiii + w 2 x 2 - A(/(*i,* 2 ) - y) 


and differentiate with respect to *1, X2 and A. 
conditions: 


wi — A 


W2 — A 


df(xi,x 2 ) 

dxi 

df{xi,x 2 ) 

8x2 


= 0 
= 0 


This gives us the first-order 


f(xi,x 2 ) -y = 0 . 


The last condition is simply the constraint. We can rearrange the first two 
equations and divide the first equation by the second equation to get 


wi _ df(xi,X2)/dxi 

w 2 df(x 1 ,x 2 )/dx 2 ' 


Note that this is the same first-order condition that we derived in the text: the 
technical rate of substitution must equal the factor price ratio. 

Let’s apply this method to the Cobb-Douglas production function: 

/(* i,x 2 ) = x 1 x b 2 . 


The cost-minimization problem is then 


min wia:i + 102*2 

x\,X2 


such that *“*2 = y- 

Here we have a specific functional form, and we can solve it using either the 
substitution method or the Lagrangian method. The substitution method would 
involve first solving the constraint for *2 as a function of *1: 
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and then substituting this into the objective function to get the unconstrained 
minimization problem 

• / -n \ 1/6 

minwiii + ti)2 yx 1 I 

XI V ' 

We could now differentiate with respect to xi and set resulting derivative equal 
to zero, as usual. The resulting equation can be solved to get X\ as a function 
of wi, w 2, and y , to get the conditional factor demand for x\. This isn’t hard to 
do, but the algebra is messy, so we won’t write down the details. 

We will, however, solve the Lagrangian problem. The three first-order condi¬ 
tions are 

w\ = Xax 1 x 2 

\j a 6 — 1 

w 2 = Xbx 1 x 2 

a b 

y = x 1 x 2 - 

Multiply the first equation by x\ and the second equation by X2 to get 
wixi = \ax\x\ = Xay 
w 2 X2 = \bxix 2 = \by, 

so that 


5ft 1 
C3 | 
K 

II 

( 20 . 6 ) 

Wi 

X2 = X l ~ 

W 2 

( 20 . 7 ) 


Now we use the third equation to solve for A. Substituting the solutions for *1 
and X2 into the third first-order condition, we have 


f^oyY’ (My\ 
V Wl ) V u>2 ) 


= y- 


We can solve this equation for A to get the rather formidable expression 

A / — ai—b a b 1 — a — b \—tt 

= (a b w 1 w 2 y )“+*>, 


which, along with equations ( 20 . 6 ) and ( 20 . 7 ), gives us our final solutions for xi 
and X2 . These factor demand functions will take the form 

/a\'STh ~ b b i 

xi{wi,w 2 ,y) = ( w? +b w2 +b y°-+ b 

x 2 (wi,W2,y) = a+b w? +b w£ +b y^&. 

The cost function can be found by writing down the costs when the firm makes 
the cost-minimizing choices. That is, 


c(wi,W2,y) = wixi(wi,W2, y) + W2X2(wi,W2,y). 
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Some tedious algebra shows that 


c(wi,W2,y) 


/«\i /a\ a+b 

\b) + \b) 




y a + b - 


(Don’t worry, this formula won’t be on the final exam. It is presented only to 
demonstrate how to get an explicit solution to the cost-minimization problem by 
applying the method of Lagrange multipliers.) 

Note that costs will increase more than, equal to, or less than linearly with 
output as a + b is less than, equal to, or greater than 1 . This makes sense since 
the Cobb-Douglas technology exhibits decreasing, constant, or increasing returns 
to scale depending on the value of a + b. 




CHAPTER 



COST 

CURVES 


In the last chapter we described the cost-minimizing behavior of a firm. 
Here we continue that investigation through the use of an important geo¬ 
metric construction, the cost curve. Cost curves can be used to depict 
graphically the cost function of a firm and are important in studying the 
determination of optimal output choices. 


21.1 Average Costs 

Consider the cost function described in the last chapter. This is the function 
c(w i, W 2 , y ) that gives the minimum cost of producing output level y when 
factor prices are (uq, w^)- In the rest of this chapter we will take the factor 
prices to be fixed so that we can write cost as a function of y alone, c(y). 

Some of the costs of the firm are independent of the level of output of 
the firm. As we’ve seen in Chapter 20, these are the fixed costs. Fixed 
costs are the costs that must be paid regardless of what level of output the 
firm produces. For example, the firm might have mortgage payments that 
are required no matter what its level of output. 
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Other costs change when output changes: these are the variable costs. 
The total costs of the firm can always be written as the sum of the variable 
costs, Cv(y), and the fixed costs, F: 

c{y) = c v (y) + F. 

The average cost function measures the cost per unit of output. The 
average variable cost function measures the variable costs per unit of 
output, and the average fixed cost function measures the fixed costs 
per unit output. By the above equation: 

AC(y) = ^ = + - = AVC{y) + AFC(y) 

y y y 

where AVC(y) stands for average variable costs and AFC(y) stands for 
average fixed costs. What do these functions look like? The easiest one is 
certainly the average fixed cost function: when y = 0 it is infinite, and as 
y increases the average fixed cost decreases toward zero. This is depicted 
in Figure 21.1A. 



Construction of the average cost curve. (A) The average 
fixed costs decrease as output is increased. (B) The average vari¬ 
able costs eventually increase as output is increased. (C) The 
combination of these two effects produces a U-shaped average 
cost curve. 


Figure 

21.1 


Consider the variable cost function. Start at a zero level of output and 
consider producing one unit. Then the average variable costs at y = 1 is 
just the variable cost of producing this one unit. Now increase the level 
of production to 2 units. We would expect that, at worst, variable costs 
would double, so that average variable costs would remain constant. If 
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we can organize production in a more efficient way as the scale of output 
is increased, the average variable costs might even decrease initially. But 
eventually we would expect the average variable costs to rise. Why? If fixed 
factors are present, they will eventually constrain the production process. 

For example, suppose that the fixed costs are due to the rent or mortgage 
payments on a building of fixed size. Then as production increases, average 
variable costs—the per-unit production costs—may remain constant for a 
while. But as the capacity of the building is reached, these costs will rise 
sharply, producing an average variable cost curve of the form depicted in 
Figure 21.IB. 

The average cost curve is the sum of these two curves; thus it will have 
the U-shape indicated in Figure 21.1C. The initial decline in average costs 
is due to the decline in average fixed costs; the eventual increase in average 
costs is due to the increase in average variable costs. The combination of 
these two effects yields the U-shape depicted in the diagram. 


21.2 Marginal Costs 


There is one more cost curve of interest: the marginal cost curve. The 
marginal cost curve measures the change in costs for a given change in 
output. That is, at any given level of output y , we can ask how costs will 
change if we change output by some amount Ay: 


A c(y) _ c(y + Ay) - c{y) 
Ay Ay 


MC(y) 


We could just as well write the definition of marginal costs in terms of 
the variable cost function: 



c v (y + Ay) - c v (y) 
Ay 


This is equivalent to the first definition, since c(y) = c v (y ) + F and the 
fixed costs, F, don’t change as y changes. 

Often we think of Ay as being one unit of output, so that marginal 
cost indicates the change in our costs if we consider producing one more 
discrete unit of output. If we are thinking of the production of a discrete 
good, then marginal cost of producing y units of output is just c(y) — 
c{y — 1). This is often a convenient way to think about marginal cost, 
but is sometimes misleading. Remember, marginal cost measures a rate of 
change: the change in costs divided by a change in output. If the change 
in output is a single unit, then marginal cost looks like a simple change 
in costs, but it is really a rate of change as we increase the output by one 
unit. 
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How can we put this marginal cost curve on the diagram presented above? 
First we note the following. The variable costs are zero when zero units 
of output are produced, by definition. Thus for the first unit of output 
produced 

MC( 1) = + F ~ F = ^111 = AVC (l). 

Thus the marginal cost for the first small unit of amount equals the average 
variable cost for a single unit of output. 

Now suppose that we are producing in a range of output where average 
variable costs are decreasing. Then it must be that the marginal costs are 
less than the average variable costs in this range. For the way that you 
push an average down is to add in numbers that are less than the average. 

Think about a sequence of numbers representing average costs at differ¬ 
ent levels of output. If the average is decreasing, it must be that the cost 
of each additional unit produced is less than average up to that point. To 
make the average go down, you have to be adding additional units that are 
less than the average. 

Similarly, if we are in a region where average variable costs are rising, 
then it must be the case that the marginal costs are greater than the average 
variable costs—it is the higher marginal costs that are pushing the average 
up. 

Thus we know that the marginal cost curve must lie below the average 
variable cost curve to the left of its minimum point and above it to the 
right. This implies that the marginal cost curve must intersect the average 
variable cost curve at its minimum point. 

Exactly the same kind of argument applies for the average cost curve. If 
average costs are falling, then marginal costs must be less than the average 
costs and if average costs are rising the marginal costs must be larger than 
the average costs. These observations allow us to draw in the marginal cost 
curve as in Figure 21.2. 

To review the important points: 

• The average variable cost curve may initially slope down but need not. 
However, it will eventually rise, as long as there are fixed factors that 
constrain production. 

• The average cost curve will initially fall due to declining fixed costs but 
then rise due to the increasing average variable costs. 

• The marginal cost and average variable cost are the same at the first 
unit of output. 

• The marginal cost curve passes through the minimum point of both the 
average variable cost and the average cost curves. 
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Figure 

21.2 



Cost curves. The average cost curve ( AC ), the average vari¬ 
able cost curve ( AVC ), and the marginal cost curve ( MC ). 


21.3 Marginal Costs and Variable Costs 

There are also some other relationships between the various curves. Here is 
one that is not so obvious: it turns out that the area beneath the marginal 
cost curve up to y gives us the variable cost of producing y units of output. 
Why is that? 

The marginal cost curve measures the cost of producing each additional 
unit of output. If we add up the cost of producing each unit of output we 
will get the total costs of production—except for fixed costs. 

This argument can be made rigorous in the case where the output good 
is produced in discrete amounts. First, we note that 

Cv(y) = [cv{y) ~ c v (y - 1)] + [c v (y - 1) - c v (y - 2] + 

-1- [c„(l) - c„(0)]. 

This is true since c„(0) = 0 and all the middle terms cancel out; that is, the 
second term cancels the third term, the fourth term cancels the fifth term, 
and so on. But each term in this sum is the marginal cost at a different 
level of output: 

c v (y) = MC(y - 1) + MC{y - 2) + ■ • • + MC( 0). 
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Thus each term in the sum represents the area of a rectangle with height 
MC(y) and base of 1. Summing up all these rectangles gives us the area 
under the marginal cost curve as depicted in Figure 21.3. 


MC 


MC 


Variable costs 


y 


Marginal cost and variable costs. The area under the 
marginal cost curve gives the variable costs. 


Figure 

21.3 


EXAMPLE: Specific Cost Curves 

Let’s consider the cost function c(y) = y 2 + 1. We have the following 
derived cost curves: 

• variable costs: c v (y) = y 2 

• fixed costs: Cf(y) = 1 

• average variable costs: AVC(y) = y 2 /y = y 

• average fixed costs: AFC(y) = 1/y 

y 2 + 1 1 

• average costs: AC(y) = -= y H— 

y V 


• marginal costs: MC(y) = 2 y 
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Figure 

21.4 


These are all obvious except for the last one, which is also obvious if you 
know calculus. If the cost function is c(y ) = y 2 + F, then the marginal 
cost function is given by MC(y) = 2 y. If you don’t know this fact already, 
memorize it, because you’ll use it in the exercises. 

What do these cost curves look like? The easiest way to draw them is 
first to draw the average variable cost curve, which is a straight line with 
slope 1. Then it is also simple to draw the marginal cost curve, which is a 
straight line with slope 2. 

The average cost curve reaches its minimum where average cost equals 
marginal cost, which says 

y+-=2y, 

y 

which can be solved to give y m i n = 1. The average cost at y = 1 is 2, which 
is also the marginal cost. The final picture is given in Figure 21.4. 



Cost curves. The cost curves for c(y) = y 2 + 1. 


EXAMPLE: Marginal Cost Curves for Two Plants 

Suppose that you have two plants that have two different cost functions, 
Ci(yi) and 02 ( 2 / 2 )- You want to produce y units of output in the cheapest 
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way. In general, you will want to produce some amount of output in each 
plant. The question is, how much should you produce in each plant? 

Set up the minimization problem: 

min ci (j/i) + c 2 (y 2 ) 

Vl,V2 

such that j/i + 2/2 —y- 

Now how do you solve it? It turns out that at the optimal division of 
output between the two plants we must have the marginal cost of producing 
output at plant 1 equal to the marginal cost of producing output at plant 
2. In order to prove this, suppose the marginal costs were not equal; then 
it would pay to shift a small amount of output from the plant with higher 
marginal costs to the plant with lower marginal costs. If the output division 
is optimal, then switching output from one plant to the other can’t lower 
costs. 

Let c(y) be the cost function that gives the cheapest way to produce 
y units of output—that is, the cost of producing y units of output given 
that you have divided output in the best way between the two plants. The 
marginal cost of producing an extra unit of output must be the same no 
matter which plant you produce it in. 

We depict the two marginal cost curves, MC\(y\) and in Fig¬ 

ure 21.5. The marginal cost curve for the two plants taken together is just 
the horizontal sum of the two marginal cost curves, as depicted in Figure 
21.5C. 



Marginal costs for a firm with two plants. The overall 
marginal cost curve on the right is the horizontal sum of the 
marginal cost curves for the two plants shown on the left. 


Figure 

21.5 
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For any fixed level of marginal costs, say c, we will produce y\ and 
such that MCi(yi) = MC(y%) = c, and we will thus have yl + y% units of 
output produced. Thus the amount of output produced at any marginal 
cost c is just the sum of the outputs where the marginal cost of plant 1 
equals c and the marginal cost of plant 2 equals c: the horizontal sum of 
the marginal cost curves. 


21.4 Cost Curves for Online Auctions 

We explored an auction model of search engine advertising in Chapter 17. 
Recall the setup. When a user enters a query into a search engine, the 
query is matched with keywords chosen by advertisers. Those advertisers 
whose keywords match the query are entered into an auction. The highest 
bidder gets the most prominent position, the second-highest bidder gets 
the second most prominent position and so on. The more prominent the 
position, the more clicks the ad tends to get, other things (such as ad 
quality) being equal. 

In the auction examined earlier, it was assumed that each advertiser 
could choose a separate bid for each keyword. In practice, an advertiser 
chooses a single bid that is used in all auctions in which they participate. 
The fact that prices are determined by an auction is not all that impor¬ 
tant from an advertiser’s point of view. What matters is the relationship 
between the number of clicks the ad gets, x, and the cost of those clicks, 
c(x). 

This is just our old friend the total cost function. Once an advertiser 
knows the cost function, it can determine how many clicks it wants to buy. 
Letting v represent the value of a click, the profit maximization problem is 

max vx — c(x). 

X 

As we have seen, the optimal solution entails setting value equal to mar¬ 
ginal cost. Once the advertiser determines the profit-maximizing number 
of clicks, it can choose a bid that will yield that many clicks. 

This process is shown in Figure 21.6, which is a standard plot of average 
cost and marginal cost, with the addition of a new line illustrating the bid. 

How does the advertising discover its cost curve? One answer is that 
the advertiser can experiment with different bids and record the resulting 
number of clicks and cost. Or, the search engine can provide an estimate 
of the cost function by using the information from the auctions. 

Suppose, for example, we want to estimate what would happen if an 
advertiser increases its bid per click from 50 cents to 80 cents. The search 
engine can look at each auction in which the advertiser participates to 
how its position changes and how many new clicks it could be expected to 
receive in the new position. 
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Click-cost curves. The profit-maximizing number of clicks is 
where value equals marginal cost, which determines the appro¬ 
priate bid and average cost per click. 


21.5 Long-Run Costs 

In the above analysis, we have regarded the firm’s fixed costs as being the 
costs that involve payments to factors that it is unable to adjust in the short 
run. In the long run a firm can choose the level of its “fixed” factors—they 
are no longer fixed. 

Of course, there may still be quasi-fixed factors in the long run. That 
is, it may be a feature of the technology that some costs have to be paid 
to produce any positive level of output. But in the long run there are no 
fixed costs, in the sense that it is always possible to produce zero units of 
output at zero costs—that is, it is always possible to go out of business. If 
quasi-fixed factors are present in the long run, then the average cost curve 
will tend to have a U-shape, just as in the short run. But in the long run 
it will always be possible to produce zero units of output at a zero cost, by 
definition of the long run. 

Of course, what constitutes the long run depends on the problem we are 
analyzing. If we are considering the fixed factor to be the size of the plant, 
then the long run will be how long it would take the firm to change the 
size of its plant. If we are considering the fixed factor to be the contractual 
obligations to pay salaries, then the long run would be how long it would 
take the firm to change the size of its work force. 

Just to be specific, let’s think of the fixed factor as being plant size and 


Figure 

21.6 
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denote it by k. The firm’s short-run cost function, given that it has a plant 
of k square feet, will be denoted by c s (y,k), where the s subscript stands 
for “short run.” (Here k is playing the role of X 2 in Chapter 20.) 

For any given level of output, there will be some plant size that is the 
optimal size to produce that level of output. Let us denote this plant size 
by k{y). This is the firm’s conditional factor demand for plant size as a 
function of output. (Of course, it also depends on the prices of plant size 
and other factors of production, but we have suppressed these arguments.) 
Then, as we’ve seen in Chapter 20, the long-run cost function of the firm 
will be given by c s {y,k{y)). This is the total cost of producing an output 
level y , given that the firm is allowed to adjust its plant size optimally. 
The long-run cost function of the firm is just the short-run cost function 
evaluated at the optimal choice of the fixed factors: 

c(y ) = c s (y,k(y)). 

Let us see how this looks graphically. Pick some level of output y*, and 
let k* = k{y*) be the optimal plant size for that level of output. The short- 
run cost function for a plant of size k* will be given by c s {y : k *), and the 
long-run cost function will be given by c{y) = c s (y, k{y)), just as above. 

Now, note the important fact that the short-run cost to produce output 
y must always be at least as large as the long-run cost to produce y. Why? 
In the short run the firm has a fixed plant size, while in the long run the 
firm is free to adjust its plant size. Since one of its long-run choices is 
always to choose the plant size k* , its optimal choice to produce y units of 
output must have costs at least as small as c(y,k*). This means that the 
firm must be able to do at least as well by adjusting plant size as by having 
it fixed. Thus 

c{y) < c s (y, k*) 


for all levels of y. 

In fact, at one particular level of y , namely y*, we know that 

c{y*) = c s {y*,k*). 

Why? Because at y* the optimal choice of plant size is k*. So at y*, the 
long-run costs and the short-run costs are the same. 

If the short-run cost is always greater than the long-run cost and they 
are equal at one level of output, then this means that the short-run and the 
long-run average costs have the same property: AC{y ) < AC s {y,k*) and 
AC{y*) = AC s {y* ,k*). This implies that the short-run average cost curve 
always lies above the long-run average cost curve and that they touch at 
one point, y*. Thus the long-run average cost curve {LAC) and the short- 
run average cost curve {SAC) must be tangent at that point, as depicted 
in Figure 21.7. 
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Short-run and long-run average costs. The short-run av¬ 
erage cost curve must be tangent to the long-run average cost 
curve. 


We can do the same sort of construction for levels of output other than 
y*. Suppose we pick outputs j/i, y 2 ,..., y n and accompanying plant sizes 
fci = fc(yi), k 2 = k(y 2 ),..., k n = k(y n ). Then we get a picture like that in 
Figure 21.8. We summarize Figure 21.8 by saying that the long-run average 
cost curve is the lower envelope of the short-run average cost curves. 


21.6 Discrete Levels of Plant Size 

In the above discussion we have implicitly assumed that we can choose 
a continuous number of different plant sizes. Thus each different level of 
output has a unique optimal plant size associated with it. But we can also 
consider what happens if there are only a few different levels of plant size 
to choose from. 

Suppose, for example, that we have four different choices, k±, k 2 , k 3 , and 
Aq. We have depicted the four different average cost curves associated with 
these plant sizes in Figure 21.9. 

How can we construct the long-run average cost curve? Well, remember 
the long-run average cost curve is the cost curve you get by adjusting k 
optimally. In this case that isn’t hard to do: since there are only four 
different plant sizes, we just see which one has the lowest costs associated 
with it and pick that plant size. That is, for any level of output y , we just 


Figure 

21.7 
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Figure 

21.8 



Short-run and long-run average costs. The long-run av¬ 
erage cost curve is the envelope of the short-run average cost 
curves. 


choose the plant size that gives us the minimum cost of producing that 
output level. 

Thus the long-run average cost curve will be the lower envelope of the 
short-run average costs, as depicted in Figure 21.9. Note that this figure has 
qualitatively the same implications as Figure 21.8: the short-run average 
costs always are at least as large as the long-run average costs, and they 
are the same at the level of output where the long-run demand for the fixed 
factor equals the amount of the fixed factor that you have. 


21.7 Long-Run Marginal Costs 

We’ve seen in the last section that the long-run average cost curve is the 
lower envelope of the short-run average cost curves. What are the impli¬ 
cations of this for marginal costs? Let’s first consider the case where there 
are discrete levels of plant size. In this situation the long-run marginal 
cost curve consists of the appropriate pieces of the short-run marginal cost 
curves, as depicted in Figure 21.10. For each level of output, we see which 
short-run average cost curve we are operating on and then look at the 
marginal cost associated with that curve. 
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Discrete levels of plant size. The long-run cost curve is the 
lower envelope of the short-run curves, just as before. 


This has to hold true no matter how many different plant sizes there are, 
so the picture for the continuous case looks like Figure 21.11. The long-run 
marginal cost at any output level y has to equal the short-run marginal 
cost associated with the optimal level of plant size to produce y. 


Summary 

1. Average costs are composed of average variable costs plus average fixed 
costs. Average fixed costs always decline with output, while average vari¬ 
able costs tend to increase. The net result is a U-shaped average cost 
curve. 

2. The marginal cost curve lies below the average cost curve when average 
costs are decreasing, and above when they are increasing. Thus marginal 
costs must equal average costs at the point of minimum average costs. 

3. The area under the marginal cost curve measures the variable costs. 

4. The long-run average cost curve is the lower envelope of the short-run 
average cost curves. 


Figure 

21.9 
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Figure 

21.10 



ACj AC 2 AC 3 

Long-run marginal costs. When there are discrete levels of 
the fixed factor, the firm will choose the amount of the fixed 
factor to minimize average costs. Thus the long-run marginal 
cost curve will consist of the various segments of the short-run 
marginal cost curves associated with each different level of the 
fixed factor. 


REVIEW QUESTIONS 


1. Which of the following are true? (1) Average fixed costs never increase 
with output; (2) average total costs are always greater than or equal to 
average variable costs; (3) average cost can never rise while marginal costs 
are declining. 

2. A firm produces identical outputs at two different plants. If the marginal 
cost at the first plant exceeds the marginal cost at the second plant, how 
can the firm reduce costs and maintain the same level of output? 

3. True or false? In the long run a firm always operates at the mini¬ 
mum level of average costs for the optimally sized plant to produce a given 
amount of output. 
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Long-run marginal costs. The relationship between the 
long-run and the short-run marginal costs with continuous levels 
of the fixed factor. 


Figure 

21.11 


APPENDIX 

In the text we claimed that average variable cost equals marginal cost for the 
first unit of output. In calculus terms this becomes 

lim = i im 

H -0 y y^O 

The left-hand side of this expression is not defined at y = 0. But its limit is 
defined, and we can compute it using l’Hopital’s rule, which states that the limit 
of a fraction whose numerator and denominator both approach zero is given by 
the limit of the derivatives of the numerator and the denominator. Applying this 
rule, we have 

li m c "(^) = limy^o dc v (y)/dy = c'(0) 
y —>o y lim v ^ 0 dy/dy 1 

which establishes the claim. 

We also claimed that the area under the marginal cost curve gave us variable 
cost. This is easy to show using the fundamental theorem of calculus. Since 

dc v (y) 
dy 


MC(y) 
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we know that the area under the marginal cost curve is 


Cv{y) 


dc v (x) 

dx 


dx = c v (y ) 


Cv(0) = Cv{y). 


The discussion of long-run and short-run marginal cost curves is all pretty clear 
geometrically, but what does it mean economically? It turns out that the calculus 
argument gives the nicest intuition. The argument is simple. The marginal cost 
of production is just the change in cost that arises from changing output. In the 
short run we have to keep plant size (or whatever) fixed, while in the long run 
we are free to adjust it. So the long-run marginal cost will consist of two pieces: 
how costs change holding plant size fixed plus how costs change when plant size 
adjusts. But if the plant size is chosen optimally, this last term has to be zero! 
Thus the long-run and the short-run marginal costs have to be the same. 

The mathematical proof involves the chain rule. Using the definition from the 
text: 

c(y ) = c s (y,k(y)). 

Differentiating with respect to y gives 


dc(y) = dc a (y,k) dc s (y,k) dk(y) 

dy dy dk dy 


If we evaluate this at a specific level of output y* and its associated optimal 
plant size k* = k(y*), we know that 


dc s (y*,k*) = 
dk 

because that is the necessary first-order condition for k* to be the cost-minimizing 
plant size at y* . Thus the second term in the expression cancels out and all that 
we have left is the short-run marginal cost: 

dc(y*) = dc s (y*,k*) 
dy dy 









CHAPTER 



FIRM 

SUPPLY 


In this chapter we will see how to derive the supply curve of a competitive 
firm from its cost function using the model of profit maximization. The 
first thing we have to do is to describe the market environment in which 
the firm operates. 


22.1 Market Environments 

Every firm faces two important decisions: choosing how much it should pro¬ 
duce and choosing what price it should set. If there were no constraints on 
a profit-maximizing firm, it would set an arbitrarily high price and produce 
an arbitrarily large amount of output. But no firm exists in such an un¬ 
constrained environment. In general, the firm faces two sorts of constraints 
on its actions. 

First, it faces the technological constraints summarized by the pro¬ 
duction function. There are only certain feasible combinations of inputs 
and outputs, and even the most profit-hungry firm has to respect the re¬ 
alities of the physical world. We have already discussed how we can sum¬ 
marize the technological constraints, and we’ve seen how the technological 
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constraints lead to the economic constraints summarized by the cost 
function. 

But now we bring in a new constraint—or at least an old constraint 
from a different perspective. This is the market constraint. A firm can 
produce whatever is physically feasible, and it can set whatever price it 
wants ... but it can only sell as much as people are willing to buy. 

If it sets a certain price p it will sell a certain amount of output x. We 
call the relationship between the price a firm sets and the amount that it 
sells the demand curve facing the firm. 

If there were only one firm in the market, the demand curve facing the 
firm would be very simple to describe: it is just the market demand curve 
described in earlier chapters on consumer behavior. For the market demand 
curve measures how much of the good people want to buy at each price. 
Thus the demand curve summarizes the market constraints facing a firm 
that has a market all to itself. 

But if there are other firms in the market, the constraints facing an 
individual firm will be different. In this case, the firm has to guess how the 
other firms in the market will behave when it chooses its price and output. 

This is not an easy problem to solve, either for firms or for economists. 
There are a lot of different possibilities, and we will try to examine them 
in a systematic way. We’ll use the term market environment to describe 
the ways that firms respond to each other when they make their pricing 
and output decisions. 

In this chapter we’ll examine the simplest market environment, that 
of pure competition. This is a good comparison point for many other 
environments, and it is of considerable interest in its own right. First let’s 
give the economist’s definition of pure competition, and then we’ll try to 
justify it. 


22.2 Pure Competition 

To a lay person, “competition” has the connotation of intense rivalry. 
That’s why students are often surprised that the economist’s definition 
of competition seems so passive: we say that a market is purely compet¬ 
itive if each firm assumes that the market price is independent of its own 
level of output. Thus, in a competitive market, each firm only has to worry 
about how much output it wants to produce. Whatever it produces can 
only be sold at one price: the going market price. 

In what sort of environment might this be a reasonable assumption for a 
firm to make? Well, suppose that we have an industry composed of many 
firms that produce an identical product, and that each firm is a small part 
of the market. A good example would be the market for wheat. There 
are thousands of wheat farmers in the United States, and even the largest 
of them produces only an infinitesimal fraction of the total supply. It is 
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reasonable in this case for any one firm in the industry to take the market 
price as being predetermined. A wheat farmer doesn’t have to worry about 
what price to set for his wheat—if he wants to sell any at all, he has to sell 
it at the market price. He is a price taker: the price is given as far as he 
is concerned; all he has to worry about is how much to produce. 

This kind of situation—an identical product and many small firms—is a 
classic example of a situation where price-taking behavior is sensible. But 
it is not the only case where price-taking behavior is possible. Even if there 
are only a few firms in the market, they may still treat the market price as 
being outside their control. 

Think of a case where there is a fixed supply of a perishable good: say 
fresh fish or cut flowers in a marketplace. Even if there are only 3 or 4 
firms in the market, each firm may have to take the other firms’ prices as 
given. If the customers in the market only buy at the lowest price, then 
the lowest price being offered is the market price. If one of the other firms 
wants to sell anything at all, it will have to sell at the market price. So 
in this sort of situation competitive behavior—taking the market price as 
outside of your control—seems plausible as well. 

We can describe the relationship between price and quantity perceived 
by a competitive firm in terms of a diagram as in Figure 22.1. As you can 
see, this demand curve is very simple. A competitive firm believes that it 
will sell nothing if it charges a price higher than the market price. If it sells 
at the market price, it can sell whatever amount it wants, and if it sells 
below the market price, it will get the entire market demand at that price. 

As usual we can think of this kind of demand curve in two ways. If we 
think of quantity as a function of price, this curve says that you can sell 
any amount you want at or below the market price. If we think of price 
as a function of quantity, it says that no matter how much you sell, the 
market price will be independent of your sales. 

(Of course, this doesn’t have to be true for literally any amount. Price 
has to be independent of your output for any amount you might consider 
selling. In the case of the cut-flower seller, the price has to be indepen¬ 
dent of how much she sells for any amount up to her stock on hand -the 
maximum that she could consider selling.) 

It is important to understand the difference between the “demand curve 
facing a firm” and the “market demand curve.” The market demand curve 
measures the relationship between the market price and the total amount 
of output sold. The demand curve facing a firm measures the relationship 
between the market price and the output of that particular firm. 

The market demand curve depends on consumers’ behavior. The demand 
curve facing a firm not only depends on consumers’ behavior but it also 
depends on the behavior of the other firms. The usual justification for the 
competitive model is that when there are many small firms in the market, 
each one faces a demand curve that is essentially flat. But even if there 
are only two firms in the market, and one insists on charging a fixed price 
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no matter what, then the other firm in the market will face a competitive 
demand curve like the one depicted in Figure 22.1. Thus the competitive 
model may hold in a wider variety of circumstances than is apparent at 
first glance. 



P 

Market demand 


Market * 

Demand curve 
facing firm 


price P 


Figure 

22.1 

y 

The demand curve facing a competitive firm. The firm’s 
demand is horizontal at the market price. At higher prices, the 
firm sells nothing, and below the market price it faces the entire 
market demand curve. 


22.3 The Supply Decision of a Competitive Firm 

Let us use the facts we have discovered about cost curves to figure out 
the supply curve of a competitive firm. By definition a competitive firm 
ignores its influence on the market price. Thus the maximization problem 
facing a competitive firm is 


ma xpy — c(y). 
v 

This just says that the competitive firm wants to maximize its profits: the 
difference between its revenue, py , and its costs, c(y). 

What level of output will a competitive firm choose to produce? Answer: 
it will operate where marginal revenue equals marginal cost—where the 
extra revenue gained by one more unit of output just equals the extra cost 
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of producing another unit. If this condition did not hold, the firm could 
always increase its profits by changing its level of output. 

In the case of a competitive firm, marginal revenue is simply the price. 
To see this, ask how much extra revenue a competitive firm gets when it 
increases its output by Ay. We have 

A R = pAy 

since by hypothesis p doesn’t change. Thus the extra revenue per unit of 
output is given by 

A R 



which is the expression for marginal revenue. 

Thus a competitive firm will choose a level of output y where the marginal 
cost that it faces at y is just equal to the market price. In symbols: 


P= MC{y). 


For a given market price, p : we want to find the level of output where 
profits are maximal. If price is greater than marginal cost at some level of 
output y , then the firm can increase its profits by producing a little more 
output. For price greater than marginal costs means 


P~ 


Ac 

Ay 


> 0 . 


So increasing output by Ay means that 


pAy 


Ac 

Ay 


Ay > 0. 


Simplifying we find that 

pAy — Ac > 0, 

which means that the increase in revenues from the extra output exceeds 
the increase in costs. Thus profits must increase. 

A similar argument can be made when price is less than marginal cost. 
Then reducing output will increase profits, since the lost revenues are more 
than compensated for by the reduced costs. 

So at the optimal level of output, a firm must be producing where price 
equals marginal costs. Whatever the level of the market price p , the firm 
will choose a level of output y where p = MC(y). Thus the marginal cost 
curve of a competitive firm is precisely its supply curve. Or put another 
way, the market price is precisely marginal cost—as long as each firm is 
producing at its profit-maximizing level. 
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Figure 

22.2 



Marginal cost and supply. Although there are two levels of 
output where price equals marginal cost, the profit-maximizing 
quantity supplied can lie only on the upward-sloping part of the 
marginal cost curve. 


22.4 An Exception 


Well ... maybe not precisely. There are two troublesome cases. The first 
case is when there are several levels of output where price equals marginal 
cost, such as the case depicted in Figure 22.2. Here there are two levels of 
output where price equals marginal cost. Which one will the firm choose? 

It is not hard to see the answer. Consider the first intersection, where 
the marginal cost curve is sloping down. Now if we increase output a little 
bit here, the costs of each additional unit of output will decrease. That’s 
what it means to say that the marginal cost curve is decreasing. But the 
market price will stay the same. Thus profits must definitely go up. 

So we can rule out levels of output where the marginal cost curve slopes 
downward. At those points an increase in output must always increase 
profits. The supply curve of a competitive firm must lie along the upward- 
sloping part of the marginal cost curve. This means that the supply curve 
itself must always be upward sloping. The “Giffen good” phenomenon 
cannot arise for supply curves. 

Price equals marginal cost is a necessary condition for profit maximiza¬ 
tion. It is not in general a sufficient condition. Just because we find a 
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point where price equals marginal cost doesn’t mean that we’ve found the 
maximum profit point. But if we find the maximum profit point, we know 
that price must equal marginal cost. 


22.5 Another Exception 

This discussion is assuming that it is profitable to produce something. 
After all it could be that the best thing for a firm to do is to produce zero 
output. Since it is always possible to produce a zero level of output, we 
have to compare our candidate for profit maximization with the choice of 
doing nothing at all. 

If a firm produces zero output it still has to pay its fixed costs, F. Thus 
the profits from producing zero units of output are just —F. The profits 
from producing a level of output y are py — c v (y) — F. The firm is better 
off going out of business when 

—F > py — c v (y) - F, 

that is, when the “profits” from producing nothing, and just paying the 
fixed costs, exceed the profits from producing where price equals marginal 
cost. Rearranging this equation gives us the shutdown condition: 

AVC(y) = > p , 

y 

If average variable costs are greater than p : the firm would be better off 
producing zero units of output. This makes good sense, since it says that 
the revenues from selling the output y don’t even cover the variable costs 
of production, c v {y). In this case the firm might as well go out of business. 
If it produces nothing it will lose its fixed costs, but it would lose even more 
if it continued to produce. 

This discussion indicates that only the portions of the marginal cost 
curve that lie above the average variable cost curve are possible points on 
the supply curve. If a point where price equals marginal cost is beneath 
the average variable cost curve, the firm would optimally choose to produce 
zero units of output. 

We now have a picture for the supply curve like that in Figure 22.3. The 
competitive firm produces along the part of the marginal cost curve that 
is upward sloping and lies above the average variable cost curve. 


EXAMPLE: Pricing Operating Systems 

A computer requires an operating system in order to run, and most hard¬ 
ware manufacturers sell their computers with the operating systems already 
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Figure 

22.3 



Average variable cost and supply. The supply curve is the 
upward-sloping part of the marginal cost curve that lies above 
the average variable cost curve. The firm will not operate on 
those points on the marginal cost curve below the average cost 
curve since it could have greater profits (less losses) by shutting 
down. 


installed. In the early 1980s several operating system producers were fight¬ 
ing for supremacy in the IBM-PC-compatible microcomputer market. The 
common practice at that time was for the producer of the operating system 
to charge the computer manufacturer for each copy of the operating system 
that was installed on a microcomputer that it sold. 

Microsoft Corporation offered an alternative plan in which the charge to 
the manufacturer was based on the number of microcomputers that were 
built by the manufacturer. Microsoft set their licensing fee low enough that 
this plan was attractive to the producers. 

Note the clever nature of Microsoft’s pricing strategy: once the contract 
with a manufacturer was signed, the marginal cost of installing MS-DOS 
on an already-built computer was zero. Installing a competing operating 
system, on the other hand, could cost $50 to $100. The hardware manu¬ 
facturer (and ultimately the user) paid Microsoft for the operating system, 
but the structure of the pricing contract made MS-DOS very attractive 
relative to the competition. As a result, Microsoft ended up being the de¬ 
fault operating system installed on microcomputers and achieved a market 
penetration of over 90 percent. 
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22.6 The Inverse Supply Function 

We have seen that the supply curve of a competitive firm is determined by 
the condition that price equals marginal cost. As before we can express 
this relation between price and output in two ways: we can either think 
of output as a function of price, as we usually do, or we can think of the 
“inverse supply function” that gives price as a function of output. There 
is a certain insight to be gained by looking at it in the latter way. Since 
price equals marginal cost at each point on the supply curve, the market 
price must be a measure of marginal cost for every firm operating in the 
industry. A firm that produces a lot of output and a firm that produces 
only a little output must have the same marginal cost, if they are both 
maximizing profits. The total cost of production of each firm can be very 
different, but the marginal cost of production must be the same. 

The equation p = MC(y) gives us the inverse supply function: price as 
a function of output. This way of expressing the supply curve can be very 
useful. 


22.7 Profits and Producer's Surplus 

Given the market price we can now compute the optimal operating posi¬ 
tion for the firm from the condition that p = MC(y ). Given the optimal 
operating position we can compute the profits of the firm. In Figure 22.4 
the area of the box is just p*y *, or total revenue. The area y*AC(y*) is 
total costs since 

yAC(y) = y — = c(y). 

y 

Profits are simply the difference between these two areas. 

Recall our discussion of producer’s surplus in Chapter 14. We defined 
producer’s surplus to be the area to the left of the supply curve, in analogy 
to consumer’s surplus, which was the area to the left of the demand curve. 
It turns out that producer’s surplus is closely related to the profits of a firm. 
More precisely, producer’s surplus is equal to revenues minus variable costs, 
or equivalently, profits plus the fixed costs: 

profits = py - c v (y) - F 

producer’s surplus = py — c v (y). 

The most direct way to measure producer’s surplus is to look at the 
difference between the revenue box and the box y*AVC(y*), as in Fig¬ 
ure 22.5A. But there are other ways to measure producer’s surplus by 
using the marginal cost curve itself. 
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Figure 

22.4 



Profits. Profits are the difference between total revenue and 
total costs, as shown by the colored rectangle. 


We know from Chapter 21 that the area under the marginal cost curve 
measures the total variable costs. This is true because the area under the 
marginal cost curve is the cost of producing the first unit plus the cost of 
producing the second unit, and so on. So to get producer’s surplus, we can 
subtract the area under the marginal cost curve from the revenue box and 
get the area depicted in Figure 22.5B. 

Finally, we can combine the two ways of measuring producer’s surplus. 
Use the “box” definition up to the point where marginal cost equals average 
variable cost, and then use the area above the marginal cost curve, as 
shown in Figure 22.5C. This latter way is the most convenient for most 
applications since it is just the area to the left of the supply curve. Note 
that this is consistent with definition of producer’s surplus given in Chapter 
14. 

We are seldom interested in the total amount of producer’s surplus; more 
often it is the change in producer’s surplus that is of interest. The change 
in producer’s surplus when the firm moves from output y* to output y' will 
generally be a trapezoidal shaped region like that depicted in Figure 22.6. 

Note that the change in producer’s surplus in moving from y* to y' is 
just the change in profits in moving from y* to y', since by definition the 
fixed costs don’t change. Thus we can measure the impact on profits of 
a change in output from the information contained in the marginal cost 
curve, without having to refer to the average cost curve at all. 
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Producer’s surplus. Three equivalent ways to measure pro¬ 
ducer’s surplus. Panel A depicts a box measuring revenue minus 
variable cost. Panel B depicts the area above the marginal cost 
curve. Panel C uses the box up until output z (area R) and 
then uses the area above the marginal cost curve (area T). 


Figure 

22.5 


EXAMPLE: The Supply Curve for a Specific Cost Function 

What does the supply curve look like for the example given in the last 
chapter where c(y) = y 2 + 1? In that example the marginal cost curve 
was always above the average variable cost curve, and it always sloped 
upward. So “price equals marginal costs” gives us the supply curve directly. 
Substituting 2 y for marginal cost we get the formula 

P = 2 y. 

This gives us the inverse supply curve, or price as a function of output. 
Solving for output as a function of price we have 

S(p) = y= | 

as our formula for the supply curve. This is depicted in Figure 22.7. 
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Figure 

22.6 


MC 

P 



s 

p' 



Supply curve 

Change ir 
producer 
surplus 

5 / 


P 




y* y> y 

The change in producer’s surplus. Since the supply curve 
coincides with the upward-sloping part of the marginal cost 
curve, the change in producer’s surplus will typically have a 
roughly trapezoidal shape. 


If we substitute this supply function into the definition of profits, we can 
calculate the maximum profits for each price p. Performing the calculation 
we have: 

7r(p) =py- c(y) 



How do the maximum profits relate to producer’s surplus? In Figure 22.7 
we see that producer’s surplus—the area to the left of the supply curve 
between a price of zero and a price of p —will be a triangle with a base of 
y = p/2 and a height of p. The area of this triangle is 



Comparing this with the profits expression, we see that producer’s surplus 
equals profits plus fixed costs, as claimed. 
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A specific example of a supply curve. The supply curve 
and producer’s surplus for the cost function c(y) = y 2 + 1. 


22.8 The Long-Run Supply Curve of a Firm 

The long-run supply function for the firm measures how much the firm 
would optimally produce when it is allowed to adjust plant size (or whatever 
factors are fixed in the short run). That is, the long-run supply curve will 
be given by 

p = MCi(y) = MC{y, k{y)). 

The short-run supply curve is given by price equals marginal cost at some 
fixed level of k: 

P= MC(y, k). 

Note the difference between the two expressions. The short-run supply 
curve involves the marginal cost of output holding k fixed at a given level 
of output, while the long-run supply curve involves the marginal cost of 
output when you adjust k optimally. 

Now, we know something about the relationship between short-run and 
long-run marginal costs: the short-run and the long-run marginal costs co¬ 
incide at the level of output y* where the fixed factor choice associated with 
the short-run marginal cost is the optimal choice, k*. Thus the short-run 
and the long-run supply curves of the firm coincide at y*, as in Figure 22.8. 

In the short run the firm has some factors in fixed supply; in the long 
run these factors are variable. Thus, when the price of output changes, the 


Figure 

22.7 
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The short-run and long-run supply curves. Typically the 
long-run supply curve will be more elastic than the short-run 


supply curve. 


firm has more choices to adjust in the long run than in the short run. This 
suggests that the long-run supply curve will be more responsive to price— 
more elastic—than the short-run supply curve, as illustrated in Figure 22.8. 

What else can we say about the long-run supply curve? The long run is 
defined to be that time period in which the firm is free to adjust all of its 
inputs. One choice that the firm has is the choice of whether to remain in 
business. Since in the long run the firm can always get zero profits by going 
out of business, the profits that the firm makes in long-run equilibrium have 
to be at least zero: 


PV ~ c{y) > 0, 


which means 



This says that in the long run price has to be at least as large as average 
cost. Thus the relevant part of the long-run supply curve is the upward- 
sloping part of the marginal cost curve that lies above the long-run average 
cost curve, as depicted in Figure 22.9. 

This is completely consistent with the short-run story. In the long run 
all costs are variable costs, so the short-run condition of having price above 
average variable cost is equivalent to the long-run condition of having price 
above average cost. 
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The long-run supply curve. The long-run supply curve will 
be the upward-sloping part of the long-run marginal cost curve 
that lies above the average cost curve. 


22.9 Long-Run Constant Average Costs 


One particular case of interest occurs when the long-run technology of the 
firm exhibits constant returns to scale. Here the long-run supply curve will 
be the long-run marginal cost curve, which, in the case of constant average 
cost, coincides with the long-run average cost curve. Thus we have the 
situation depicted in Figure 22.10, where the long-run supply curve is a 
horizontal line at c m i n , the level of constant average cost. 

This supply curve means that the firm is willing to supply any amount of 
output at p = c m i„, an arbitrarily large amount of output at p > c m i n , and 
zero output at p < c m i n . When we think about the replication argument 
for constant returns to scale this makes perfect sense. Constant returns 
to scale implies that if you can produce 1 unit for c m ; n dollars, you can 
produce n units for nc m ; n dollars. Therefore you will be willing to supply 
any amount of output at a price equal to c m ; n , and an arbitrarily large 
amount of output at any price greater than c m i n . 

On the other hand, if p < c m i n , so that you cannot break even supply¬ 
ing even one unit of output, you will certainly not be able to break even 
supplying n units of output. Hence, for any price less than c m i n , you will 
want to supply zero units of output. 


Figure 

22.9 
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Figure 

22.10 



Constant average costs. In the case of constant average 
costs, the long-run supply curve will be a horizontal line. 


Summary 

1. The relationship between the price a firm charges and the output that 
it sells is known as the demand curve facing the firm. By definition, a 
competitive firm faces a horizontal demand curve whose height is deter¬ 
mined by the market price—the price charged by the other firms in the 
market. 

2. The (short-run) supply curve of a competitive firm is that portion of its 
(short-run) marginal cost curve that is upward sloping and lies above the 
average variable cost curve. 

3. The change in producer’s surplus when the market price changes from 
Pi to P 2 is the area to the left of the marginal cost curve between pi and 
P 2 - It also measures the firm’s change in profits. 

4. The long-run supply curve of a firm is that portion of its long-run mar¬ 
ginal cost curve that is upward sloping and that lies above its long-run 
average cost curve. 
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REVIEW QUESTIONS 


1. A firm has a cost function given by c(y ) = 10y 2 + 1000. What is its 
supply curve? 

2. A firm has a cost function given by c(y) = 10y 2 + 1000. At what output 
is average cost minimized? 

3. If the supply curve is given by S(p) = 100 + 20 p, what is the formula for 
the inverse supply curve? 

4. A firm has a supply function given by S(p) = 4 p. Its fixed costs are 100. 
If the price changes from 10 to 20, what is the change in its profits? 

5. If the long-run cost function is c(y) = y 2 + 1, what is the long-run supply 
curve of the firm? 

6. Classify each of the following as either technological or market con¬ 
straints: the price of inputs, the number of other firms in the market, the 
quantity of output produced, and the ability to produce more given the 
current input levels. 

7. What is the major assumption that characterizes a purely competitive 
market? 

8. In a purely competitive market a firm’s marginal revenue is always equal 
to what? A profit-maximizing firm in such a market will operate at what 
level of output? 

9. If average variable costs exceed the market price, what level of output 
should the firm produce? What if there are no fixed costs? 

10. Is it ever better for a perfectly competitive firm to produce output even 
though it is losing money? If so, when? 

11. In a perfectly competitive market what is the relationship between the 
market price and the cost of production for all firms in the industry? 


APPENDIX 

The discussion in this chapter is very simple if you speak calculus. The profit- 
maximization problem is 

max py — c(y) 
y 


such that y > 0. 
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The necessary conditions for the optimal supply, y *, are the first-order condition 

P~c(y *) = 0 

and the second-order condition 

~c"(y*) < 0. 

The first-order condition says price equals marginal cost, and the second-order 
condition says that the marginal cost must be increasing. Of course this is pre¬ 
suming that y* > 0. If price is less than average variable cost at y *, it will pay 
the firm to produce a zero level of output. To determine the supply curve of a 
competitive firm, we must find all the points where the first- and second-order 
conditions are satisfied and compare them to each other—and to y = 0—and 
pick the one with the largest profits. That’s the profit-maximizing supply. 


CHAPTER 



INDUSTRY 

SUPPLY 


We have seen how to derive a firm’s supply curve from its marginal cost 
curve. But in a competitive market there will typically be many firms, so 
the supply curve the industry presents to the market will be the sum of the 
supplies of all the individual firms. In this chapter we will investigate the 

industry supply curve. 


23.1 Short-Run Industry Supply 

We begin by studying an industry with a fixed number of firms, n. We let 
Si(p) be the supply curve of firm i, so that the industry supply curve, 
or the market supply curve is 


n 

S(p) ='52Si(p): 

i=1 

which is the sum of the individual supply curves. Geometrically we take 
the sum of the quantities supplied by each firm at each price, which gives 
us a horizontal sum of supply curves, as in Figure 23.1. 
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Figure 

23.1 


S, 


Si + S2 


y 

The industry supply curve. The industry supply curve 
( 1 S 1 + S 2 ) is the sum of the individual supply curves (Si and 

5 a ). 


23.2 Industry Equilibrium in the Short Run 

In order to find the industry equilibrium we take this market supply curve 
and find the intersection with the market demand curve. This gives us an 
equilibrium price, p*. 

Given this equilibrium price, we can go back to look at the individual 
firms and examine their output levels and profits. A typical configuration 
with three firms, A, B, and C, is illustrated in Figure 23.2. In this example, 
firm A is operating at a price and output combination that lies on its 
average cost curve. This means that 


P = A>. 

y 

Cross multiplying and rearranging, we have 

PV ~ c{y) = 0. 

Thus firm A is making zero profits. 

Firm B is operating at a point where price is greater than average cost: 
p > c(y)/y , which means it is making a profit in this short-run equilibrium. 
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Short-run equilibrium. An example of a short-run equilib¬ 
rium with three firms. Firm A is making zero profits, firm B is 
making positive profits, and firm C is making negative profits, 
that is, making a loss. 


Figure 

23.2 


Firm C is operating where price is less than average cost, so it is making 
negative profits, that is, making a loss. 

In general, combinations of price and output that lie above the average 
cost curve represent positive profits, and combinations that lie below rep¬ 
resent negative profits. Even if a firm is making negative profits, it will still 
be better for it to stay in business in the short run if the price and output 
combination lie above the average variable cost curve. For in this case, it 
will make less of a loss by remaining in business than by producing a zero 
level of output. 


23.3 Industry Equilibrium in the Long Run 


In the long run, firms are able to adjust their fixed factors. They can 
choose the plant size, or the capital equipment, or whatever to maximize 
their long-run profits. This just means that they will move from their 
short-run to their long-run cost curves, and this adds no new analytical 
difficulties: we simply use the long-run supply curves as determined by the 
long-run marginal cost curve. 

However, there is an additional long-run effect that may occur. If a firm 
is making losses in the long run, there is no reason to stay in the industry, so 
we would expect to see such a firm exit the industry, since by exiting from 
the industry, the firm could reduce its losses to zero. This is just another 
way of saying that the only relevant part of a firm’s supply curve in the 
long run is that part that lies on or above the average cost curve—since 
these are locations that correspond to nonnegative profits. 

















416 INDUSTRY SUPPLY (Ch. 23) 


Similarly, if a firm is making profits we would expect entry to occur. Af¬ 
ter all, the cost curve is supposed to include the cost of all factors necessary 
to produce output, measured at their market price (i.e., their opportunity 
cost). If a firm is making profits in the long run it means that anybody 
can go to market, acquire those factors, and produce the same amount of 
output at the same cost. 

In most competitive industries there are no restrictions against new firms 
entering the industry; in this case we say the industry exhibits free entry. 
However, in some industries there arc barriers to entry, such as licenses 
or legal restrictions on how many firms can be in the industry. For example, 
regulations on the sales of alcohol in many states prevent free entry to the 
retail liquor industry. 

The two long-run effects—acquiring different fixed factors and the entry 
and exit phenomena—are closely related. An existing firm in an indus¬ 
try can decide to acquire a new plant or store and produce more output. 
Or a new firm may enter the industry by acquiring a new plant and pro¬ 
ducing output. The only difference is in who owns the new production 
facilities. 

Of course as more firms enter the industry—and firms that are losing 
money exit the industry—the total amount produced will change and lead 
to a change in the market price. This in turn will affect profits and the 
incentives to exit and enter. What will the final equilibrium look like in an 
industry with free entry? 

Let’s examine a case where all firms have identical long-run cost func¬ 
tions, say, c{y). Given the cost function we can compute the level of out¬ 
put where average costs are minimized, which we denote by y*. We let 
p* = c(y*)/y* be the minimum value of average cost. This cost is signifi¬ 
cant because it is the lowest price that could be charged in the market and 
still allow firms to break even. 

We can now graph the industry supply curves for each different number 
of firms that can be in the market. Figure 23.3 illustrates the industry 
supply curves if there are 1,... ,4 firms in the market. (We are using 4 
firms only for purposes of an example; in reality, one would expect there 
to be many more firms in a competitive industry.) Note that since all 
firms have the same supply curve, the total amount supplied if 2 firms 
are in the market is just twice as much as when 1 firm is the market, the 
supply when 3 firms are in the market is just three times as much, and so 
on. 

Now add two more lines to the diagram: a horizontal line at p*, the min¬ 
imum price consistent with nonnegative profits, and the market demand 
curve. Consider the intersections of the demand curve and the supply 
curves for n = 1,2,... firms. If firms enter the industry when positive 
profits are being made, then the relevant intersection is the lowest price 
consistent with nonnegative profits. This is denoted by p' in Figure 23.3, 
and it happens to occur when there are three firms in the market. If one 
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Industry supply curves with free entry. Supply curves for 
1,... ,4 firms. The equilibrium price, p ', occurs at the lowest 
possible intersection of demand and supply such that p' > p*. 


more firm enters the market, profits are pushed to be negative. In this 
case, the maximum number of competitive firms this industry can support 
is three. 


23.4 The Long-Run Supply Curve 

The construction given in the last section—draw the industry supply curves 
for each possible number of firms that could be in the market and then look 
for the largest number of firms consistent with nonnegative profits—is per¬ 
fectly rigorous and easy to apply. However, there is a useful approximation 
that usually gives something very close to the right answer. 

Let’s see if there is some way to construct one industry supply curve out 
of the n curves we have above. The first thing to note is that we can rule 
out all of the points on the supply curve that are below p *, since those can 
never be long-run operating positions. But we can also rule out some of 
the points on the supply curves above p*. 

We typically assume that the market demand curve is downward slop¬ 
ing. The steepest possible demand curve is therefore a vertical line. This 
implies that points like A in Figure 23.3 would never be observed—for any 
downward-sloping demand curve that passed through A would also have 
to intersect a supply curve associated with a larger number of firms, as 


Figure 

23.3 
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Figure 

23.4 


shown by the hypothetical demand curve D" passing through the point A 
in Figure 23.3. 

Thus we can eliminate a portion of each supply curve from being a possi¬ 
ble long-run equilibrium position. Every point on the one-firm supply curve 
that lies to the right of the intersection of the two-firm supply curve and 
the line determined by p* cannot be consistent with long-run equilibrium. 
Similarly, every point on the two-firm supply curve that lies to the right of 
the intersection of the three-firm supply curve with the p* line cannot be 
consistent with long-run equilibrium ... and every point on the n-firm sup¬ 
ply curve that lies to the right of the intersection of the n + 1-firm supply 
curve with the p* line cannot be consistent with equilibrium. 

The parts of the supply curves on which the long-run equilibrium can 
actually occur are indicated by the black line segments in Figure 23.4. The 
n th black line segment shows all the combinations of prices and industry 
output that are consistent with having n firms in long-run equilibrium. 
Note that these line segments get flatter and flatter as we consider larger 
and larger levels of industry output, involving more and more firms in the 
industry. 



The long-run supply curve. We can eliminate portions 
of the supply curves that can never be intersections with a 
downward-sloping market demand curve in the long run, such 
as the points on each supply curve to the right of the dotted 
lines. 
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Why do these curves get flatter? Think about it. If there is one firm 
in the market and the price goes up by A p, it will produce, say, Ay more 
output. If there are n firms in the market and the price goes up by A p, 
each firm will produce Ay more output, so we will get nAy more output in 
total. This means that the supply curve will be getting flatter and flatter 
as there are more and more firms in the market, since the supply of output 
will be more and more sensitive to price. 

By the time we get a reasonable number of firms in the market, the 
slope of the supply curve will be very flat indeed. Flat enough so that it is 
reasonable to take it as having a slope of zero—that is, as taking the long- 
run industry supply curve to be a flat line at price equals minimum average 
cost. This will be a poor approximation if there are only a few firms in the 
industry in the long run. But the assumption that a small number of firms 
behave competitively will also probably be a poor approximation! If there 
are a reasonable number of firms in the long run, the equilibrium price 
cannot get far from minimum average cost. This is depicted in Figure 23.5. 



Approximate long-run supply curve. The long-run sup¬ 
ply curve will be approximately flat at price equals minimum 
average cost. 


Figure 

23.5 


This result has the important implication that in a competitive industry 
with free entry, profits cannot get very far from zero. If there are significant 
levels of profits in an industry with free entry, it will induce other firms to 
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enter that industry and thereby push profits toward zero. 

Remember, the correct calculation of economic costs involves measuring 
all factors of production at their market prices. As long as all factors are 
being measured and properly priced, a firm earning positive profits can 
be exactly duplicated by anyone. Anyone can go to the open market and 
purchase the factors of production necessary to produce the same output 
in the same way as the firm in question. 

In an industry with free entry and exit, the long-run average cost curve 
should be essentially flat at a price equal to the minimum average cost. This 
is just the kind of long-run supply curve that a single firm with constant 
returns to scale would have. This is no accident. We argued that constant 
returns to scale was a reasonable assumption since a firm could always 
replicate what it was doing before. But another firm could replicate it as 
well! Expanding output by building a duplicate plant is just like a new 
firm entering the market with duplicate production facilities. Thus the 
long-run supply curve of a competitive industry with free entry will look 
like the long-run supply curve of a firm with constant returns to scale: a 
flat line at price equals minimum average cost. 


EXAMPLE: Taxation in the Long Run and in the Short Run 

Consider an industry that has free entry and exit. Suppose that initially it 
is in a long-run equilibrium with a fixed number of firms, and zero profits, 
as depicted in Figure 23.6. In the short run, with a fixed number of firms, 
the supply curve of the industry is upward sloping, while in the long run, 
with a variable number of firms, the supply curve is flat at price equals 
minimum average cost. 

What happens when we put a tax on this industry? We use the geometric 
analysis discussed in Chapter 16: in order to find the new price paid by 
the demanders, we shift the supply curve up by the amount of the tax. 

In general, the consumers will face a higher price and the producers will 
receive a lower price after the tax is imposed. But the producers were just 
breaking even before the tax was imposed; thus they must be losing money 
at any lower price. These economic losses will encourage some firms to 
leave the industry. Thus the supply of output will be reduced, and the 
price to the consumers will rise even further. 

In the long run, the industry will supply along the horizontal long-run 
supply curve. In order to supply along this curve, the firms will have to 
receive a price equal to the minimum average cost—just what they were 
receiving before the tax was imposed. Thus the price to the consumers will 
have to rise by the entire amount of the tax. 

In Figure 23.6, the equilibrium is initially at Pjj = Pg. Then the tax 
is imposed, shifting the short-run supply curve up by the amount of the 
tax, and the equilibrium price paid by the demanders increases to P' D . The 
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Taxation in the short run and long run. In the short run, 
with a fixed number of firms, the industry supply curve will have 
an upward slope, so that part of the tax falls on the consumers 
and part on the firms. In the long run, the industry supply 
curve will be horizontal so all of the tax falls on the consumers. 


equilibrium price received by the suppliers falls to P' s = P' D — t. But this 
is only in the short run—when there are a fixed number of firms in the 
industry. Because of free entry and exit, the long-run supply curve in the 
industry is horizontal at Pd = Ps = minimum average cost. Hence, in the 
long run, shifting up the supply curve implies that the entire amount of 
the tax gets passed along to the consumers. 

To sum up: in an industry with free entry, a tax will initially raise the 
price to the consumers by less than the amount of the tax, since some of 
the incidence of the tax will fall on the producers. But in the long run the 
tax will induce firms to exit from the industry, thereby reducing supply, so 
that consumers will eventually end up paying the entire burden of the tax. 


23.5 The Meaning of Zero Profits 


In an industry with free entry, profits will be driven to zero by new entrants: 
whenever profits are positive, there will be an incentive for a new firm to 
come in to acquire some of those profits. When profits are zero it doesn’t 


Figure 

23.6 
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mean that the industry disappears; it just means that it stops growing, 
since there is no longer an inducement to enter. 

In a long-run equilibrium with zero profits, all of the factors of production 
are being paid their market price—the same market price that these factors 
could earn elsewhere. The owner of the firm, for example, is still collecting 
a payment for her labor time, or for the amount of money she invested 
in the firm, or for whatever she contributes to the operation of the firm. 
The same goes for all other factors of production. The firm is still making 
money—it is just that all the money that it makes is being paid out to 
purchase the inputs that it uses. Each factor of production is earning the 
same amount in this industry that it could earn elsewhere, so there are 
no extra rewards—no pure profits—to attract new factors of production to 
this industry. But there is nothing to cause them to leave either. Industries 
in long-run equilibrium with zero profits are mature industries; they’re not 
likely to appear as the cover story in Business Week, but they form the 
backbone of the economy. 

Remember, economic profits are defined using the market prices of all 
factors of production. The market prices measure the opportunity cost 
of those factors—what they could earn elsewhere. Any amount of money 
earned in excess of the payments to the factors of production is a pure 
economic profit. But whenever someone finds a pure economic profit, other 
people will try to enter the industry and acquire some of that profit for 
themselves. It is this attempt to capture economic profits that eventually 
drives them to zero in a competitive industry with free entry. 

In some quarters, the profit motive is regarded with some disdain. But 
when you think about it purely on economic grounds, profits are providing 
exactly the right signals as far as resource allocation is concerned. If a firm 
is making positive profits, it means that people value the output of the 
firm more highly than they value the inputs. Doesn’t it make sense to have 
more firms producing that kind of output? 


23.6 Fixed Factors and Economic Rent 

If there is free entry, profits are driven to zero in the long run. But not 
every industry has free entry. In some industries the number of firms in 
the industry is fixed. 

A common reason for this is that there are some factors of production 
that are available in fixed supply. We said that in the long run the fixed 
factors could be bought or sold by an individual firm. But there are some 
factors that are fixed for the economy as a whole even in the long run. 

The most obvious example of this is in resource-extraction industries: 
oil in the ground is a necessary input to the oil-extraction industry, and 
there is only so much oil around to be extracted. A similar statement 
could be made for coal, gas, precious metals, or any other such resource. 
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Agriculture gives another example. There is only a certain amount of land 
that is suitable for agriculture. 

A more exotic example of such a fixed factor is talent. There are only 
a certain number of people who possess the necessary level of talent to be 
professional athletes or entertainers. There may be “free entry” into such 
fields—but only for those who are good enough to get in! 

There are other cases where the fixed factor is fixed not by nature, but 
by law. In many industries it is necessary to have a license or permit, and 
the number of these permits may be fixed by law. The taxicab industry in 
many cities is regulated in this way. Liquor licenses are another example. 

If there are restrictions such as the above on the number of firms in the 
industry, so that firms cannot enter the industry freely, it may appear that 
it is possible to have an industry with positive profits in the long run, with 
no economic forces to drive those profits to zero. 

This appearance is wrong. There is an economic force that pushes profits 
to zero. If a firm is operating at a point where its profits appear to be 
positive in the long run, it is probably because we are not appropriately 
measuring the market value of whatever it is that is preventing entry. 

Here it is important to remember the economic definition of costs: we 
should value each factor of production at its market price —its opportunity 
cost. If it appears that a farmer is making positive profits after we have 
subtracted his costs of production, it is probably because we have forgotten 
to subtract the cost of his land. 

Suppose that we manage to value all of the inputs to farming except for 
the land cost, and we end up with 7r dollars per year for profits. How much 
would the land be worth on a free market? How much would someone pay 
to rent that land for a year? 

The answer is: they would be willing to rent it for 7r dollars per year, 
the “profits” that it brings in. You wouldn’t even have to know anything 
about farming to rent this land and earn n dollars—after all, we valued 
the farmer’s labor at its market price as well, and that means that you can 
hire a farmer and still make n dollars of profit. So the market value of that 
land—its competitive rent—is just ir. The economic profits to farming are 
zero. 

Note that the rental rate determined by this procedure may have nothing 
whatsoever to do with the historical cost of the farm. What matters is not 
what you bought it for, but what you can sell it for -that’s what determines 
opportunity cost. 

Whenever there is some fixed factor that is preventing entry into an 
industry, there will be an equilibrium rental rate for that factor. Even with 
fixed factors, you can always enter an industry by buying out the position 
of a firm that is currently in the industry. Every firm in the industry has 
the option of selling out—and the opportunity cost of not doing so is a cost 
of production that it has to consider. 

Thus in one sense it is always the possibility of entry that drives profits to 
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zero. After all, there are two ways to enter an industry: you can form a new 
firm, or you can buy out an existing firm that is currently in the industry. 
If a new firm can buy everything necessary to produce in an industry and 
still make a profit, it will do so. But if there are some factors that are in 
fixed supply, then competition for those factors among potential entrants 
will bid the prices of these factors up to a point where the profit disappears. 


EXAMPLE: Taxi Licenses in New York City 

Earlier we said that licenses to operate New York City taxicabs sell for 
about $100,000. Yet in 1986 taxicab drivers made only about $400 for a 
50-hour week; this translated into less than an $8 hourly wage. The New 
York Taxi and Limosine Commission argued that this wage was too low 
to attract skilled drivers and that taxi fares should be raised in order to 
attract better drivers. 

An economist would argue that allowing the fares to increase would have 
virtually no effect on the take-home pay of the drivers; all that would 
happen is that the value of the taxicab license would increase. We can see 
why by examining the commission’s figures for the costs of operating a taxi. 
In 1986, the lease rate was $55 for a day shift and $65 for a night shift. 
The driver who leased the taxi paid for the gasoline and netted about $80 
a day in income. 

But note how much the owner of the taxicab license made. Assuming 
that the cab could be rented for two shifts for 320 days a year, the lease 
income comes to $38,400. Insurance, depreciation, maintenance, and so on 
amounted to about $21,100 a year; this leaves a net profit of $17,300 per 
year. Since the license cost about $100,000, this indicates a total return of 
about 17 percent. 

An increase in the rate that taxis were allowed to charge would be re¬ 
flected directly in the value of the license. A fare increase that brought 
in an extra $10,000 a year would result in a license’s value increasing by 
about $60,000. The wage rate for the cab drivers—which is set in the labor 
market—would not be affected by such a change. 1 


23.7 Economic Rent 

The examples in the last section are instances of economic rent. Eco¬ 
nomic rent is defined as those payments to a factor of production that are 
in excess of the minimum payment necessary to have that factor supplied. 


i 


Figures are taken from an unsigned editorial in the New York Times, August 17, 
1986. 
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Consider, for example, the case of oil discussed earlier. In order to pro¬ 
duce oil you need some labor, some machinery, and, most importantly, 
some oil in the ground! Suppose that it costs $1 a barrel to pump oil out 
of the ground from an existing well. Then any price in excess of $1 a barrel 
will induce firms to supply oil from existing wells. But the actual price of 
oil is much higher than $1 a barrel. People want oil for various reasons, 
and they are willing to pay more than its cost of production to get it. The 
excess of the price of oil over its cost of production is economic rent. 

Why don’t firms enter this industry? Well, they try. But there is only 
a certain amount of oil available. Oil will sell for more than its cost of 
production because of the limited supply. 

Now consider taxicab licenses. Viewed as pieces of paper, these cost 
almost nothing to produce. But in New York City a taxicab license can 
sell for $100,000! Why don’t people enter this industry and produce more 
taxicab licenses? The reason is that entry is illegal—the supply of taxicab 
licenses is controlled by the city. 

Farmland is yet another example of economic rent. In the aggregate, the 
total amount of land is fixed. There would be just as much land supplied 
at zero dollars an acre as at $1000 an acre. Thus in the aggregate, the 
payments to land constitute economic rent. 

From the viewpoint of the economy as a whole, it is the price of agri¬ 
cultural products that determines the value of agricultural land. But from 
the viewpoint of the individual farmer, the value of his land is a cost of 
production that enters into the pricing of his product. 

This is depicted in Figure 23.7. Here AVC represents the average cost 
curve for all factors of production excluding land costs. (We are assuming 
that land is the only fixed factor.) If the price of the crop grown on this 
land is p*, then the “profits” attributable to the land are measured by 
the area of the box: these are the economic rents. This is how much the 
land would rent for in a competitive market—whatever it took to drive the 
profits to zero. 

The average cost curve including the value of the land is labeled AC. If 
we measure the value of the land correctly, the economic profits to operating 
the farm will be exactly zero. Since the equilibrium rent for the land will 
be whatever it takes to drive profits to zero, we have 

P*U* ~ c v (y*) - rent = 0 


or 

rent = p*y* - c v (y*). (23.1) 

This is precisely what we referred to as producer’s surplus earlier. Indeed, 
it is the same concept, simply viewed in a different light. Thus we can also 
measure rent by taking the area to the left of the marginal cost curve, as 
we saw earlier. 
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Figure 

23.7 



Economic rent for land. The area of the box represents the 
economic rent on the land. 


Given the definition of rent in equation (23.1), it is now easy to see the 
truth of what we said earlier: it is the equilibrium price that determines 
rent, not the reverse. The firm supplies along its marginal cost curve— 
which is independent of the expenditures on the fixed factors. The rent 
will adjust to drive profits to zero. 


23.8 Rental Rates and Prices 

Since we are measuring output in flow units—so much output per unit of 
time, we should be careful to measure profits and rents in dollars per unit 
of time. Thus in the above discussion we talked about the rent per year 
for land or for a taxicab license. 

If the land or the license is to be sold outright rather than rented, the 
equilibrium price would be the present value of the stream of rental pay¬ 
ments. This is a simple consequence of the usual argument that assets 
generating a stream of payments should sell for their present values in a 
competitive market. 

EXAMPLE: Liquor Licenses 

In the United States, each state sets its own policy with respect to sales of 
alcohol. Some states have a liquor monopoly; other states issue licenses to 
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those who wish to sell alcohol. In some cases, licenses are issued on payment 
of a fee; in other cases, the number of licenses is fixed. In Michigan, for 
example, the number of licenses for sales of beer and wine for consumption 
on premises is limited to one for every 1,500 residents. 

After each Federal census, a state liquor control board allocates licenses 
to communities whose populations have grown. (Licenses are not taken 
away from communities whose populations have fallen, however.) This 
artificial scarcity of licenses has created a vibrant market for licenses to 
serve liquor in many fast-growing communities. For example, in 1983 Ann 
Arbor, Michigan, had sixty-six existing liquor licenses. Six new licenses 
were allowed to be issued as a result of the 1980 census, and 33 applicants 
lined up to lobby for these licenses. At the time, the market value of a liquor 
license was about $80,000. The local newspaper ran a story asserting that 
“demand exceeds supply for liquor licenses.” It was hardly surprising to the 
local economists that giving away an $80,000 asset for a zero price resulted 
in excess demand! 

There have been many proposals to relax the liquor control laws in Michi¬ 
gan by allowing the state to issue new licenses. However, these proposals 
have never been enacted into law due to the opposition of various political 
groups. Some of these groups are opposed to the consumption of alcohol 
on grounds of public health or religion. Others have somewhat different 
motives. For example, one of the most vociferous opponents of relaxed 
liquor laws is the Michigan Licensed Beverage Association, a group that 
represents the sellers of alcoholic beverages in Michigan. Though at first 
glance it appears paradoxical that this group would oppose liberalization 
of the liquor laws, a little reflection indicates a possible reason: issuing 
more liquor licenses would undoubtably lower the resale value of existing 
licenses—imposing significant capital losses on current holders of such li¬ 
censes. 

23.9 The Politics of Rent 

Often economic rent exists because of legal restrictions on entry into the 
industry. We mentioned two examples above: taxicab licenses and liquor 
licenses. In each of these cases the number of licenses is fixed by law, thus 
restricting entry to the industry and creating economic rents. 

Suppose that the New York City government wants to increase the num¬ 
ber of operating taxicabs. What will happen to the market value of the 
existing taxicab licenses? Obviously they will fall in value. This reduction 
in value hits the industry right in the pocketbook, and it is sure to create 
a lobbying force to oppose any such move. 

The federal government also artificially restricts output of some products 
in such a way as to create a rent. For example, the federal government has 
declared that tobacco can only be grown on certain lands. The value of 
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this land is then determined by the demand for tobacco products. Any 
attempt to eliminate this licensing system has to contend with a serious 
lobby. Once the government creates artificial scarcity, it is very hard to 
eliminate it. The beneficiaries of the artificial scarcity—the people who 
have acquired the right to operate in the industry—will vigorously oppose 
any attempts to enlarge the industry. 

The incumbents in an industry in which entry is legally restricted may 
well devote considerable resources to maintaining their favored position. 
Lobbying expenses, lawyers’ fees, public relations costs, and so on can be 
substantial. From the viewpoint of society these kinds of expenses represent 
pure social waste. They aren’t true costs of production; they don’t lead 
to any more output being produced. Lobbying and public relations efforts 
just determine who gets the money associated with existing output. 

Efforts directed at keeping or acquiring claims to factors in fixed supplies 
are sometimes referred to as rent seeking. From the viewpoint of society 
they represent a pure deadweight loss since they don’t create any more 
output, they just change the market value of existing factors of production. 


EXAMPLE: Farming the Government 

There is only one good thing to say about the U.S. program of farm sub¬ 
sidies: it produces a never-ending source of examples for economics text¬ 
books. Every new reform of the farm program brings new problems. “If 
you want to find the holes in a program, just toss them out to farmers. No 
one is more innovative in finding ways to use them,” says Terry Bar, the 
vice president of the National Council of Farm Cooperatives. 2 

Up until 1996 the basic structure of farm subsidies in the U.S. involved 
price supports: the Federal government guaranteed a support price for a 
crop and would make up the difference if the price fell below the support 
price. In order to qualify for this program, a farmer had to agree not to 
farm a certain fraction of his land. 

By the very nature of this plan, most of the benefits accrued to the large 
farmers. According to one calculation, 13 percent of the direct Federal 
subsidies were going to the 1 percent of the farmers who had sales over 
$500,000 a year. The Food Security Act of 1985 significantly restricted the 
payments to large farmers. As a result, the farmers broke up their holdings 
by leasing the land to local investors. The investors would acquire parcels 
large enough to take advantage of the subsidies, but too small to run into 
the restrictions aimed at large farmers. Once the land was acquired the 
investor would register it with a government program that would pay the 


2 


Quoted in William Robbins, “Limits on Subsidies to Big Farms Go Awry, Sending 
Costs Climbing,” New York Times, June 15, 1987, Al. 
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investor not to plant the land. This practice became known as “farming 
the government.” 

According to one study, the restriction on payments to the large farmers 
in the 1985 farm act resulted in the creation of 31,000 new applicants for 
farm subsidies. The cost of these subsidies was in the neighborhood of $2.3 
billion. 

Note that the ostensible goal of the program- restricting the amount of 
government subsidies paid to large farmers—has not been achieved. When 
the large farmers rent their land to small farmers, the market price of the 
rents depends on the generosity of the Federal subsidies. The higher the 
subsidies, the higher the equilibrium rent the large farmers receive. The 
benefits from the subsidy program still falls on those who initially own 
the land, since it is ultimately the value of what the land can earn—either 
from growing crops or farming the government -that determines its market 
value. 

The Farm Act of 1996 promised a phaseout of most agricultural subsidies 
by 2002. However, the 1998 federal budget restored over 6 billion dollars 
of federal farm subsidies, illustrating once again how hard it is to reconcile 
politics and economics. 


23.10 Energy Policy 

We end this chapter with an extended example that uses some of the con¬ 
cepts we have developed. 

In 1974 the Organization of Petroleum Exporting Countries (OPEC) 
levied a significant increase in the price of oil. Countries that had no 
domestically produced petroleum had little choice about energy policy— 
the price of oil and goods produced using oil had to rise. 

At that time the United States produced about half of its domestic oil 
consumption, and Congress felt that it was unfair that the domestic pro¬ 
ducers should receive “windfall profits” from an uncontrolled increase in 
price. (The term windfall profits refers to an increase in profits due to 
some outside event, as opposed to an increase in profits due to production 
decisions.) Consequently, Congress devised a bizarre plan to attempt to 
hold down the price of products that used oil. The most prominent of 
these products is gasoline, so we will analyze the effect of the program for 
that market. 


Two-Tiered Oil Pricing 

The policy adopted by Congress was known as “two-tiered” oil pricing, 
and it went something like this. Imported oil would sell for whatever its 
market price was, but domestic oil—oil produced from wells that were in 
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place before 1974—would sell for its old price: the price that it sold for 
before OPEC. Roughly speaking, we’ll say that imported oil sold for about 
$15 a barrel, while domestic oil sold for around $5. The idea was that the 
average price of oil would then be about $10 a barrel and this would help 
hold down the price of gasoline. 

Could such a scheme work? Let’s think about it from the viewpoint of 
the gasoline producers. What would the supply curve of gasoline look like? 
In order to answer this question we have to ask what the marginal cost 
curve for gasoline looked like. 

What would you do if you were a gasoline refiner? Obviously you would 
try to use the cheap domestic oil first. Only after you had exhausted your 
supplies of domestic oil would you turn to the more expensive imported oil. 
Thus the aggregate marginal cost curve—the industry supply curve—for 
gasoline would have to look something like that depicted in Figure 23.8. 
The curve takes a jump at the point where the U.S. production of domestic 
oil is exhausted and the imported oil begins to be used. Before that point, 
the domestic price of oil measures the relevant factor price for producing 
gasoline. After that point, it is the price of foreign oil that is the relevant 
factor price. 

Figure 23.8 depicts the supply curve for gasoline if all oil were to sell for 
the world price of $15 a barrel, and if all oil were to sell for the domestic 
price of $5 a barrel. If domestic oil actually sells for $5 a barrel and foreign 
oil sells for $15 a barrel, then the supply curve for gasoline will coincide 
with the $5-a-barrel supply curve until the cheaper domestic oil is used up, 
and then coincide with the $15-a-barrel supply curve. 

Now let’s find the intersection of this supply curve with the market de¬ 
mand curve to find the equilibrium price in Figure 23.8. The diagram 
reveals an interesting fact: the price of gasoline is exactly the same in the 
two-tiered system as it would be if all oil sold at the price of foreign oil! 
The price of gasoline is determined by the marginal cost of production, and 
the marginal cost is determined by the cost of the imported oil. 

If you think about it a minute, this makes perfectly good sense. The 
gasoline companies will sell their product at the price the market will bear. 
Just because you were lucky enough to get some cheap oil doesn’t mean 
you won’t sell your gasoline for the same price that other firms are selling 
theirs for. 

Suppose for the moment that all oil did sell for one price, and that 
equilibrium was reached at the price p *. Then the government comes along 
and lowers the price of the first 100 barrels of oil that each refiner used. 
Will this affect their supply decision? No way—in order to affect supply 
you have to change the incentives at the margin. The only way to get a 
lower price of gasoline is to increase the supply, which means that you have 
to make the marginal cost of oil cheaper. 

The two-tiered oil pricing policy was simply a transfer from the domestic 
oil producers to the domestic oil refiners. The domestic producers got $10 
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The supply curve for gasoline. Under the two-tiered oil 
pricing policy, the supply curve of gasoline would be discontin¬ 
uous, jumping from the lower supply curve to the upper supply 
curve when the cheaper oil was exhausted. 


less for their oil than they would have otherwise, and the profits they would 
have gotten went to the gasoline refiners. It had no effect on the supply of 
gasoline, and thus it could have no effect on the price of gasoline. 


Price Controls 

The economic forces inherent in this argument didn’t take long to make 
themselves felt. The Department of Energy soon realized that it couldn’t 
allow market forces to determine the price of gasoline under the two-tiered 
system—since market forces alone would imply one price of gasoline, which 
would be the same price that would prevail in the absence of the two-tiered 
system. 

So they instituted price controls on gasoline. Each refiner was required 
to charge a price for gasoline that was based on the costs of producing the 
gasoline—which in turn was primarily determined by the cost of the oil 
that the refiner was able to purchase. 

The availability of cheap domestic oil varied with location. In Texas the 
refiners were close to the major source of production and thus were able to 


Figure 
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purchase large supplies of cheap oil. Due to the price controls, the price of 
Texas gasoline was relatively cheap. In New England, virtually all oil had 
to be imported, and thus the price of gasoline in New England was quite 
high. 

When you have different prices for the same product, it is natural for 
firms to try to sell at the higher price. Again, the Department of Energy 
had to intervene to prevent the uncontrolled shipping of gasoline from low- 
price regions to high-price regions. The result of this intervention was the 
famous gasoline shortages of the mid-seventies. Periodically, the supply of 
gasoline in a region of the country would dry up, and there would be little 
available at any price. The free market system of supplying petroleum 
products had never exhibited such behavior; the shortages were entirely 
due to the two-tiered oil pricing system coupled with price controls. 

Economists pointed this out at the time, but it didn’t have much effect on 
policy. What did have an effect was lobbying by the gasoline refiners. Much 
of the domestic oil was sold on long-term contracts, and some refiners were 
able to buy a lot of it, while others could only buy the expensive foreign 
oil. Naturally they objected that this was unfair, so Congress figured out 
another scheme to allocate the cheap domestic oil more equitably. 


The Entitlement Program 

This program was known as the “entitlement program,” and it went some¬ 
thing like this. Each time a refiner bought a barrel of expensive foreign 
oil he got a coupon that allowed him to buy a certain amount of cheap 
domestic oil. The amount that the refiner was allowed to buy depended 
on supply conditions, but let’s say that it was one for one: each barrel of 
foreign oil that he bought for $15 allowed him to buy one barrel of domestic 
oil for $5. 

What did this do to the marginal price of oil? Now the marginal price of 
oil was just a weighted average of the domestic price and the foreign price 
of oil; in the one-for-one case described above, the price would be $10. The 
effect on the supply curve of gasoline is depicted in Figure 23.9. 

The marginal cost of oil was reduced all right, and that meant that 
the price of gasoline was reduced as well. But look who is paying for 
it: the domestic oil producers! The United States was buying foreign oil 
that cost $15 a barrel in real dollars and pretending that it only cost $10. 
The domestic oil producers were required to sell their oil for less than the 
market price on the world oil market. We were subsidizing the importation 
of foreign oil and forcing the domestic oil producers to pay the subsidy! 

Eventually this program was abandoned as well, and the U.S. imposed 
a tax on the domestic production of oil so that the U.S. oil producers 
wouldn’t reap windfall profits due to OPEC’s action. Of course, such a tax 
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The entitlement program. Under the entitlement program 
the supply curve of gasoline would lie between the supply curve 
if all oil were provided at the imported price and the supply 
curve if all oil were provided at the domestic price. 


discouraged production of domestic oil, and thereby increases the price of 
gasoline, but this was apparently acceptable to Congress at the time. 


23.11 Carbon Tax Versus Cap and Trade 

Motivated by concerns about global warming, several climatologists have 
urged governments to institute policies to reduce carbon emissions. Two of 
these reduction policies are particularly interesting from an economic point 
of view: carbon taxes and cap and trade. 

A carbon tax imposes a tax on carbon emissions, while a cap and trade 
system grants licenses to emit carbon that can be traded on an organized 
market. To see how these systems compare, let us examine a simple model. 


Optimal Production of Emissions 

We begin by examining the problem of producing a target amount of emis¬ 
sions in the least costly way. Suppose that there are two firms that have 


Figure 
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Figure 

23.10 


current levels of carbon emissions denoted by (xj , 2 ) 2 ). Firm i can reduce 
its level of emissions by Xi at a cost of Ci(xi). Figure 23.10 shows a possible 
shape for this cost function. 

The goal is to reduce emissions by some target amount, T, in the least 
costly way. This minimization problem can be written as 

min Ci(xi) + c 2 (x 2 ) 

Xl,X2 

such that x\ + x 2 = T. 

If it knew the cost functions, the government could, in principle, solve this 
optimization problem and assign a specific amount of emission reductions 
to each firm. However, this is impractical if there are thousands of carbon 
emitters. The challenge is to find a decentralized, market-based way of 
achieving the optimal solution. 

Let us examine the structure of the optimization problem. It is clear 
that at the optimal solution the marginal cost of reducing emissions must 
be the same for each firm. Otherwise it would pay to increase emissions in 
the firm with the lower marginal cost and decrease emissions in the firm 
with the higher marginal cost. This would keep the total output at the 
target level while reducing costs. 

Hence we have a simple principle: at the optimal solution, the marginal 
cost of emissions reduction should be the same for every firm. In the two- 
firm case we are examining, we can find this optimal point using a simple 
diagram. Let MC\{x\) be the marginal cost of reducing emissions by x\ 



Cost function for emissions. The curve shows the cost 
associated with emission reductions. 






CARBON TAX VERSUS CAP AND TRADE 435 


for firm 1 and write the marginal cost of emission-reduction for firm 2 as 
a function of firm l’s output: — xi), assuming the target is met. 

We plot these two curves in Figure 23.11. The point where they intersect 
determines the optimal division of emission reductions between the two 
firms given that T emission reductions are to be produced in total. 



gives the optimal carbon tax and emissions license price. 


Figure 

23.11 


A Carbon Tax 

Instead of solving for the cost-minimizing solution directly, let us instead 
consider a decentralized solution using a carbon tax. In this framework, 
the government sets a tax rate t that it charges for carbon emissions. 

If firm 1 starts with X\ and reduces its emissions by xi, then it ends up 
with xi — X\ emissions. If it pays t per unit emitted, its carbon tax bill 
would be t(xi — Xi). 

Faced with this tax, firm 1 would want to choose that level of emission 
reductions that minimized its total cost of operation: the cost of reducing 
emissions plus the cost of paying the carbon tax on the emissions that 
remain. This leads to the cost minimization problem 

min ci(xi) + t(x i — Xi). 

Xi 

Clearly the firm will want to reduce emissions up to the point where the 
marginal cost of further reductions just equals the carbon tax, i.e., where 
t = MCi(xi). 
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If the carbon tax is set to be the rate t*, as determined in Figure 23.11, 
then the total amount of carbon emissions will be the targeted amount, 
T. Thus the carbon tax give a decentralized way to achieve the optimal 
outcome. 


Cap and Trade 

Suppose, alternatively that there is no carbon tax, but that the government 
issues tradable emissions licenses. Each license allows the firm that 
holds it to produce a certain amount of carbon emissions. The government 
chooses the number of emissions licenses to achieve the target reduction. 

We imagine a market in these licenses so each firm can buy a license 
to emit x units of carbon at a price of p per unit. The cost to firm 1 of 
reducing its emissions by x\ is ci(xi) + p(X\ — X\). Clearly the firm will 
want to operate where the price of an emissions license equals the marginal 
cost, p = MCi(xi). That is, it will choose the level of emissions at the 
point where the cost of reducing carbon emissions by one unit would just 
equal the cost saved by not having to purchase a license. 

Hence the marginal cost curve gives us the supply of emissions as a 
function of the price. The equilibrium price is the price where the total 
supply of emissions equals the target amount T. The associated price is 
the same as the optimal carbon tax rate t* in Figure 23.11. 

The question that remains is how to distribute the licenses. One way 
would be to have the government sell the licenses to firms. This is essen¬ 
tially the same as the carbon tax system. The government could pick a 
price and sell however many licenses are demanded at that price. Alter¬ 
natively, it could pick a target level of emissions and auction off permits, 
letting the firms themselves determine a price. This is one type of “cap and 
trade” system. Both of these policies should lead to essentially the same 
market-clearing price. 

Another possibility would be for the government to hand out the licenses 
to the firms according to some formula. This formula could be based on 
a variety of criteria, but presumably an important reason to award these 
valuable permits would be building political support for the program. Per¬ 
mits might be handed out based on objective criteria, such as which firms 
have the most employees, or they might be handed out based on which 
firms have donated the most to some political causes. 

From the economic point of view, it doesn’t matter whether the gov¬ 
ernment owns the licenses and sells them to the firms (which is basically 
a carbon tax system) or whether the firms are given the licenses and sell 
them to each other (which is basically cap and trade). 

If a cap and trade system is created, firms will find it attractive to invest 
in ways to acquire the emission permits. For example, they would want 
to lobby Congress for such licenses. These lobbying expenditures should 
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be counted as part of the cost of the system, as described in our earlier 
discussion of rent seeking. Of course, the carbon tax system would also be 
subject to similar lobbying. Firms would undoubtedly seek special carbon 
tax exemptions for one reason or another, but it has been argued that the 
carbon tax system is less susceptible to political manipulation than a cap 
and trade system. 


Summary 

1. The short-run supply curve of an industry is just the horizontal sum of 
the supply curves of the individual firms in that industry. 

2. The long-run supply curve of an industry must take into account the 
exit and entry of firms in the industry. 

3. If there is free entry and exit, then the long-run equilibrium will involve 
the maximum number of firms consistent with nonnegative profits. This 
means that the long-run supply curve will be essentially horizontal at a 
price equal to the minimum average cost. 

4. If there are forces preventing the entry of firms into a profitable industry, 
the factors that prevent entry will earn economic rents. The rent earned is 
determined by the price of the output of the industry. 


REVIEW QUESTIONS 


1. If Si(p) = p —10 and S 2 ip) = p— 15, then at what price does the industry 
supply curve have a kink in it? 

2. In the short run the demand for cigarettes is totally inelastic. In the long 
run, suppose that it is perfectly elastic. What is the impact of a cigarette 
tax on the price that consumers pay in the short run and in the long run? 

3. True or false? Convenience stores near the campus have high prices 
because they have to pay high rents. 

4. True or false? In long-run industry equilibrium no firm will be losing 
money. 

5. According to the model presented in this chapter, what determines the 
amount of entry or exit a given industry experiences? 

6. The model of entry presented in this chapter implies that the more firms 
in a given industry, the (steeper, flatter) is the long-run industry supply 


curve. 
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7. A New York City cab operator appears to be making positive profits in 
the long run after carefully accounting for the operating and labor costs. 
Does this violate the competitive model? Why or why not? 


CHAPTER 



MONOPOLY 


In the preceding chapters we have analyzed the behavior of a competitive 
industry, a market structure that is most likely when there are a large 
number of small firms. In this chapter we turn to the opposite extreme 
and consider an industry structure when there is only one firm in the 
industry—a monopoly. 

When there is only one firm in a market, that firm is very unlikely to 
take the market price as given. Instead, a monopoly would recognize its 
influence over the market price and choose that level of price and output 
that maximized its overall profits. 

Of course, it can’t choose price and output independently; for any given 
price, the monopoly will be able to sell only what the market will bear. If 
it chooses a high price, it will be able to sell only a small quantity. The 
demand behavior of the consumers will constrain the monopolist’s choice 
of price and quantity. 

We can view the monopolist as choosing the price and letting the con¬ 
sumers choose how much they wish to buy at that price, or we can think of 
the monopolist as choosing the quantity, and letting the consumers decide 
what price they will pay for that quantity. The first approach is probably 
more natural, but the second turns out to be analytically more convenient. 
Of course, both approaches are equivalent when done correctly. 
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24.1 Maximizing Profits 

We begin by studying the monopolist’s profit-maximization problem. Let 
us use p(y) to denote the market inverse demand curve and c(y) to denote 
the cost function. Let r(y) = p{y)y denote the revenue function of the 
monopolist. The monopolist’s profit-maximization problem then takes the 
form 

max r(y) — c(y). 
v 

The optimality condition for this problem is straightforward: at the op¬ 
timal choice of output we must have marginal revenue equal to marginal 
cost. If marginal revenue were less than marginal cost it would pay the firm 
to decrease output, since the savings in cost would more than make up for 
the loss in revenue. If the marginal revenue were greater than the marginal 
cost, it would pay the firm to increase output. The only point where the 
firm has no incentive to change output is where marginal revenue equals 
marginal cost. 

In terms of algebra, we can write the optimization condition as 

MR = MC 


or 

A r Ac 
Ay Ay ' 

The same MR = MC condition has to hold in the case of a competitive 
firm; in that case, marginal revenue is equal to the price and the condition 
reduces to price equals marginal cost. 

In the case of a monopolist, the marginal revenue term is slightly more 
complicated. If the monopolist decides to increase its output by Ay, there 
are two effects on revenues. First it sells more output and receives a revenue 
of pAy from that. But second, the monopolist pushes the price down by 
A p and it gets this lower price on all the output it has been selling. 

Thus the total effect on revenues of changing output by Ay will be 

A?’ = pAy + yAp, 

so that the change in revenue divided by the change in output— -the mar¬ 
ginal revenue—is 

Ar A p 

(This is exactly the same derivation we went through in our discussion of 
marginal revenue in Chapter 15. You might want to review that material 
before proceeding.) 
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Another way to think about this is to think of the monopolist as choosing 
its output and price simultaneously—recognizing, of course, the constraint 
imposed by the demand curve. If the monopolist wants to sell more output 
it has to lower its price. But this lower price will mean a lower price for all 
of the units it is selling, not just the new units. Hence the term yAp. 

In the competitive case, a firm that could lower its price below the price 
charged by other firms would immediately capture the entire market from 
its competitors. But in the monopolistic case, the monopoly already has 
the entire market; when it lowers its price, it has to take into account the 
effect of the price reduction on all the units it sells. 

Following the discussion in Chapter 15, we can also express marginal 
revenue in terms of elasticity via the formula 


MR(y) = p(y ) 



and write the “marginal revenue equals marginal costs” optimality condi¬ 
tion as 


p(y) 


1 + 


<y)_ 


= MC(y). 


(24.1) 


Since elasticity is naturally negative, we could also write this expression 


as 


p(y) 



MC(y). 


From these equations it is easy to see the connection with the competitive 
case: in the competitive case, the firm faces a flat demand curve—an in¬ 
finitely elastic demand curve. This means that l/|e| = l/oo = 0, so the 
appropriate version of this equation for a competitive firm is simply price 
equals marginal cost. 

Note that a monopolist will never choose to operate where the demand 
curve is inelastic. For if |e| < 1, then l/|e| > 1, and the marginal revenue 
is negative, so it can’t possibly equal marginal cost. The meaning of this 
becomes clear when we think of what is implied by an inelastic demand 
curve: if |e| < 1, then reducing output will increase revenues, and reducing 
output must reduce total cost, so profits will necessarily increase. Thus any 
point where |e| < 1 cannot be a profit maximum for a monopolist, since it 
could increase its profits by producing less output. It follows that a point 
that yields maximum profits can only occur where |e| > 1. 


24.2 Linear Demand Curve and Monopoly 

Suppose that the monopolist faces a linear demand curve 


p(y) = a — by. 
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Then the revenue function is 

r(y ) =p{y)y = ay-by 2 , 
and the marginal revenue function is 

MR(y) = a — 2 by. 

(This follows from the formula given at the end of Chapter 15. It is easy 
to derive using simple calculus. If you don’t know calculus, just memorize 
the formula, since we will use it quite a bit.) 

Note that the marginal revenue function has the same vertical intercept, 
a, as the demand curve, but it is twice as steep. This gives us an easy way 
to draw the marginal revenue curve. We know that the vertical intercept is 
a. To get the horizontal intercept, just take half of the horizontal intercept 
of the demand curve. Then connect the two intercepts with a straight line. 
We have illustrated the demand curve and the marginal revenue curve in 
Figure 24.1. 



Monopoly with a linear demand curve. The monopolist’s 
profit-maximizing output occurs where marginal revenue equals 
marginal cost. 


Figure 

24.1 
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The optimal output, y*, is where the marginal revenue curve intersects 
the marginal cost curve. The monopolist will then charge the maximum 
price it can get at this output, p{y*). This gives the monopolist a revenue 
oip(y*)y* from which we subtract the total cost c(y*) = AC(y*)y*, leaving 
a profit area as illustrated. 


24.3 Markup Pricing 


We can use the elasticity formula for the monopolist to express its optimal 
pricing policy in another way. Rearranging equation (24.1) we have 


p(y) 


MC(y*) 

i-i/Ky)!' 


(24.2) 


This formulation indicates that the market price is a markup over marginal 
cost, where the amount of the markup depends on the elasticity of demand. 
The markup is given by 

1 

i-i /\<y)Y 

Since the monopolist always operates where the demand curve is elastic, 
we are assured that |e| > 1, and thus the markup is greater than 1. 

In the case of a constant-elasticity demand curve, this formula is espe¬ 
cially simple since e(y) is a constant. A monopolist who faces a constant- 
elasticity demand curve will charge a price that is a constant markup 
on marginal cost. This is illustrated in Figure 24.2. The curve labeled 
MC /(1 — l/|e|) is a constant fraction higher than the marginal cost curve; 
the optimal level of output occurs where p = MC/(1 — l/|e|). 


EXAMPLE: The Impact of Taxes on a Monopolist 

Let us consider a firm with constant marginal costs and ask what happens 
to the price charged when a quantity tax is imposed. Clearly the marginal 
costs go up by the amount of the tax, but what happens to the market 
price? 

Let’s first consider the case of a linear demand curve, as depicted in 
Figure 24.3. When the marginal cost curve, MC, shifts up by the amount 
of the tax to MC+t , the intersection of marginal revenue and marginal cost 
moves to the left. Since the demand curve is half as steep as the marginal 
revenue curve, the price goes up by half the amount of the tax. 

This is easy to see algebraically. The marginal revenue equals marginal 
cost plus the tax condition is 


a — 2 by = c + t. 
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Figure 

24.2 



Monopoly with constant elasticity demand. To locate the 
profit-maximizing output level we find the output level where 
the curve MC/{ 1 — l/|e|) crosses the demand curve. 


Solving for y yields 


V = 


a — c — t 
2b 


Thus the change in output is given by 


A y = 

At ~ 2b' 


The demand curve is 

P(y) = a-by, 


so price will change by — b times the change in output: 


A p 
At 



1 

2 ' 


In this calculation the factor 1/2 occurs because of the assumptions of 
the linear demand curve and constant marginal costs. Together these as¬ 
sumptions imply that the price rises by less than the tax increase. Is this 
likely to be true in general? 

The answer is no—in general a tax may increase the price by more or 
less than the amount of the tax. For an easy example, consider the case of 
a monopolist facing a constant-elasticity demand curve. Then we have 


c+t 
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OUTPUT 


Linear demand and taxation. Imposition of a tax on a 
monopolist facing a linear demand. Note that the price will rise 
by half the amount of the tax. 


Figure 

24.3 


so that 

A p 1 

A t = 1 — l/|e| ’ 

which is certainly bigger than 1. In this case, the monopolist passes on 
more than the amount of the tax. 

Another kind of tax that we might consider is the case of a profits tax. 
In this case the monopolist is required to pay some fraction r of its profits 
to the government. The maximization problem that it faces is then 

max (1 - r)\p{y)y - c(y)\. 
v 

But the value of y that maximizes profits will also maximize (1 — r) times 
profits. Thus a pure profits tax will have no effect on a monopolist’s choice 
of output. 


24.4 Inefficiency of Monopoly 

A competitive industry operates at a point where price equals marginal 
cost. A monopolized industry operates where price is greater than mar¬ 
ginal cost. Thus in general the price will be higher and the output lower 












446 MONOPOLY (Ch. 24) 


if a firm behaves monopolistically rather than competitively. For this rea¬ 
son, consumers will typically be worse off in an industry organized as a 
monopoly than in one organized competitively. 

But, by the same token, the firm will be better off! Counting both the 
firm and the consumer, it is not clear whether competition or monopoly 
will be a “better” arrangement. It appears that one must make a value 
judgment about the relative welfare of consumers and the owners of firms. 
However, we will see that one can argue against monopoly on grounds of 
efficiency alone. 

Consider a monopoly situation, as depicted in Figure 24.4. Suppose that 
we could somehow costlessly force this firm to behave as a competitor and 
take the market price as being set exogenously. Then we would have ( p c , y c ) 
for the competitive price and output. Alternatively, if the firm recognized 
its influence on the market price and chose its level of output so as to 
maximize profits, we would see the monopoly price and output (p m ,j/ m ). 


Figure 

24.4 



Inefficiency of monopoly. A monopolist produces less than 
the competitive amount of output and is therefore Pareto inef¬ 
ficient. 


Recall that an economic arrangement is Pareto efficient if there is no way 
to make anyone better off without making somebody else worse off. Is the 
monopoly level of output Pareto efficient? 
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Remember the definition of the inverse demand curve. At each level of 
output, p{y) measures how much people are willing to pay for an additional 
unit of the good. Since p(y) is greater than MC(y) for all the output levels 
between y m and y c , there is a whole range of output where people are 
willing to pay more for a unit of output than it costs to produce it. Clearly 
there is a potential for Pareto improvement here! 

For example, consider the situation at the monopoly level of output y m . 
Since p(y m ) > MC(y m ) we know that there is someone who is willing to 
pay more for an extra unit of output than it costs to produce that extra 
unit. Suppose that the firm produces this extra output and sells it to this 
person at any price p where p(y m ) > p > MC(y m ). Then this consumer 
is made better off because he or she was just willing to pay p(y m ) for that 
unit of consumption, and it was sold for p < p(y m ). Similarly, it cost the 
monopolist MC(y m ) to produce that extra unit of output and it sold it for 
p > MC{y m ). All the other units of output are being sold for the same 
price as before, so nothing has changed there. But in the sale of the extra 
unit of output, each side of the market gets some extra surplus—each side 
of the market is made better off and no one else is made worse off. We 
have found a Pareto improvement. 

It is worthwhile considering the reason for this inefficiency. The efficient 
level of output is when the willingness to pay for an extra unit of output 
just equals the cost of producing this extra unit. A competitive firm makes 
this comparison. But a monopolist also looks at the effect of increasing 
output on the revenue received from the inframarginal units, and these 
inframarginal units have nothing to do with efficiency. A monopolist would 
always be ready to sell an additional unit at a lower price than it is currently 
charging if it did not have to lower the price of all the other inframarginal 
units that it is currently selling. 


24.5 Deadweight Loss of Monopoly 

Now that we know that a monopoly is inefficient, we might want to know 
just how inefficient it is. Is there a way to measure the total loss in efficiency 
due to a monopoly? We know how to measure the loss to the consumers 
from having to pay p m rather than p c —we just look at the change in 
consumers’ surplus. Similarly, for the firm we know how to measure the 
gain in profits from charging p m rather than p c —we just use the change in 
producer’s surplus. 

The most natural way to combine these two numbers is to treat the 
firm—or, more properly, the owners of the firm—and the consumers of 
the firm’s output symmetrically and add together the profits of the firm 
and the consumers’ surplus. The change in the profits of the firm -the 
change in producer’s surplus—measures how much the owners would be 
willing to pay to get the higher price under monopoly, and the change in 
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consumers’ surplus measures how much the consumers would have to be 
paid to compensate them for the higher price. Thus the difference between 
these two numbers should give a sensible measure of the net benefit or cost 
of the monopoly. 

The changes in the producer’s and consumers’ surplus from a movement 
from monopolistic to competitive output are illustrated in Figure 24.5. The 
monopolist’s surplus goes down by A due to the lower price on the units he 
was already selling. It goes up by C due to the profits on the extra units 
it is now selling. 


Figure 

24.5 



Deadweight loss of monopoly. The deadweight loss due to 
the monopoly is given by the area B + C. 


The consumers’ surplus goes up by A, since the consumers are now get¬ 
ting all the units they were buying before at a cheaper price; and it goes 
up by B , since they get some surplus on the extra units that are being 
sold. The area A is just a transfer from the monopolist to the consumer; 
one side of the market is made better off and one side is made worse off, 
but the total surplus doesn’t change. The area B + C represents a true 
increase in surplus—this area measures the value that the consumers and 
the producers place on the extra output that has been produced. 

The area B + C is known as the deadweight loss due to the monopoly. 
It provides a measure of how much worse off people are paying the mon- 
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opoly price than paying the competitive price. The deadweight loss due to 
monopoly, like the deadweight loss due to a tax, measures the value of the 
lost output by valuing each unit of lost output at the price that people are 
willing to pay for that unit. 

To see that the deadweight loss measures the value of the lost output, 
think about starting at the monopoly point and providing one additinal 
unit of output. The value of that marginal unit of output is the market 
price. The cost of producing the additional unit of output is the marginal 
cost. Thus the “social value” of producing an extra unit will be simply 
the price minus the marginal cost. Now consider the value of the next 
unit of output; again its social value will be the gap between price and 
marginal cost at that level of output. And so it goes. As we move from 
the monopoly level of output to the competitive level of output, we “sum 
up” the distances between the demand curve and the marginal cost curve 
to generate the value of the lost output due to the monopoly behavior. 
The total area between the two curves from the monopoly output to the 
competitive output is the deadweight loss. 


EXAMPLE: The Optimal Life of a Patent 

A patent offers inventors the exclusive right to benefit from their inven¬ 
tions for a limited period of time. Thus a patent offers a kind of limited 
monopoly. The reason for offering such patent protection is to encourage 
innovation. In the absence of a patent system, it is likely that individuals 
and firms would be unwilling to invest much in research and development, 
since any new discoveries that they would make could be copied by com¬ 
petitors. 

In the United States the life of a patent is 17 years. During that period, 
the holders of the patent have a monopoly on the invention; after the 
patent expires, anyone is free to utilize the technology described in the 
patent. The longer the life of a patent, the more gains can be accrued by 
the inventors, and thus the more incentive they have to invest in research 
and development. However, the longer the monopoly is allowed to exist, the 
more deadweight loss will be generated. The benefit from a long patent life 
is that it encourages innovation; the cost is that it encourages monopoly. 
The “optimal” patent life is the period that balances these two conflicting 
effects. 

The problem of determining the optimal patent life has been examined 
by William Nordhaus of Yale University. 1 As Nordhaus indicates, the prob¬ 
lem is very complex and there are many unknown relationships involved. 
Nevertheless, some simple calculations can give some insight as to whether 

1 William Nordhaus, Invention, Growth, and Welfare (Cambridge, Mass.: M.I.T. Press, 

1969). 
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the current patent life is wildly out of line with the estimated benefits and 
costs described above. 

Nordhaus found that for “run-of-the-mill” inventions, a patent life of 17 
years was roughly 90 percent efficient—meaning that it achieved 90 percent 
of the maximum possible consumers’ surplus. On the basis of these figures, 
it does not seem like there is a compelling reason to make drastic changes 
in the patent system. 


EXAMPLE: Patent Thickets 

The intellectual property protection offered by patents provides incentives 
to innovate, but this right can be abused. Some observers have argued 
that the extensions of intellectual property rights to business processes, 
software, and other domains has resulted in lower patent quality. 

One might think of patents as having three dimensions: length, width, 
and height. The “length” is the time that the patent protection applies. 
The “width” is how broadly the claims in the patent are interpreted. The 
“height” is the standard of novelty applied in determining whether the 
patent really represents a new idea. Unfortunately, only the length is easily 
quantified. The other aspects of patent quality, breadth, and novelty, can 
be quite subjective. 

Since it has become so easy to acquire patents in recent years, many firms 
have invested in acquiring patent portfolios on nearly every aspect of their 
business. Any company that wants to enter a business and compete with an 
incumbent who owns a broad range of patents may find itself encumbered 

in a patent thicket. 

Even firms that are already well established find it important to invest 
in acquiring a patent portfolio. In 2004, Microsoft paid $440 million to 
InterTrust Technology to license a portfolio of patents related to computer 
security, and signed a 10-year pact with Sun Microsystems in which it 
paid $900 million to resolve patent issues. During 2003-04, Microsoft was 
granted over 1,000 patents. 

Why the emphasis on patent portfolios? For large companies like Mi¬ 
crosoft, their primary value is to be used as bargaining chips in cross-license 
agreements. 

The patent thickets that each company sets up operate like the nuclear 
missiles held by the U.S. and USSR during the Cold War. Each had enough 
missiles pointed at the other to create “mutually assured destruction” in 
the case that one side attacked. Hence, neither side could risk an attack. 

It’s the same issue with patent thickets. If IBM tries to sue HP for 
patent infringement, HP would pull out a collection of its own patents and 
countersue IBM for infringement in some other technology. Even companies 
that don’t particularly want to patent aspects of their business are forced 
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to do so in order to acquire the ammunition necessary for defense against 
other suits. 

The “nuclear bomb” option in patent thickets is a “preliminary injunc¬ 
tion.” In certain circumstances, a judge might compel a company to stop 
selling an item that may be infringing on someone else’s patent. This can 
be exceedingly costly. In 1986, Kodak had to completely shut down its in¬ 
stant photography business due to a court-ordered injunction. Eventually 
Kodak had to pay a billion-dollar judgment for patent infringement. 

An injunction to stop production can be a huge threat, but it has no force 
against companies that don’t produce anything. InterTrust, for example, 
didn’t sell any products—all of its income came from licensing patents. 
Hence, it could threaten to sue other companies for patent infringement 
without much worry about the threat of countersuits. 


EXAMPLE: Managing the Supply of Potatoes 

Everyone is familiar with the Organization of Petroleum Exporting Coun¬ 
tries (OPEC), the international oil cartel that attempts to influence the 
price of oil by setting production quota. Normally, coordinating produc¬ 
tion to push up prices is illegal in the United States, but there are some 
industries that are exempt from antitrust rules. 

A notable example is agricultural producers. The 1922 Capper-Volstead 
Act specifically exempts farmers from federal antitrust rules. The result 
has been the creation of a number of “agricultural marketing boards” that 
attempt to voluntarily regulate the supply of agricultural products. 

For example, the United Potato Growers of America, formed in March 
2005, has signed up potato farmers that represent over 60% of the potato 
acreage in the United States. In 2005 it claimed to reduce production of 
potatoes by 6.8 million sacks of potatoes, each weighing about 100 pounds 
a piece. According to the Wall Street Journal this is equivalent to about 
1.3 billion orders of french fries. 2 


24.6 Natural Monopoly 

We have seen earlier that the Pareto efficient amount of output in an indus¬ 
try occurs where price equals marginal cost. A monopolist produces where 
marginal revenue equals marginal cost and thus produces too little output. 
It would seem that regulating a monopoly to eliminate the inefficiency is 
pretty easy—all the regulator has to do is to set price equal to marginal 


2 


Timothy W. Martin, “This Spud’s Not for You,” Wall Street Journal, September 26, 
2009. 
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Figure 

24.6 


cost, and profit maximization will do the rest. Unfortunately, this anal¬ 
ysis leaves out one important aspect of the problem: it may be that the 
monopolist would make negative profits at such a price. 

An example of this is shown in Figure 24.6. Here the minimum point 
of the average cost curve is to the right of the demand curve, and the 
intersection of demand and marginal cost lies underneath the average cost 
curve. Even though the level of output dmc is efficient, it is not profitable. 
If a regulator set this level of output, the monopolist would prefer to go 
out of business. 



A natural monopoly. If a natural monopolist operates where 
price equals marginal cost, then it will produce an efficient level 
of output, umc > but it will be unable to cover its costs. If it 
is required to produce an output where price equals average 
cost, ?MC) then it will cover its costs, but will produce too little 
output relative to the efficient amount. 


This kind of situation often arises with public utilities. Think of a gas 
company, for example. Here the technology involves very large fixed costs— 
creating and maintaining the gas delivery pipes—and a very small marginal 
cost to providing extra units of gas—once the pipe is laid, it costs very lit¬ 
tle to pump more gas down the pipe. Similarly, a local telephone company 
involves very large fixed costs for providing the wires and switching net¬ 
work, while the marginal costs of an extra unit of telephone service is very 
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low. When there are large fixed costs and small marginal costs, you can 
easily get the kind of situation described in Figure 24.6. Such a situation 
is referred to as a natural monopoly. 

If allowing a natural monopolist to set the monopoly price is undesir¬ 
able due to the Pareto inefficiency, and forcing the natural monopoly to 
produce at the competitive price is infeasible due to negative profits, what 
is left? For the most part natural monopolies are regulated or operated 
by governments. Different countries have adopted different approaches. In 
some countries the telephone service is provided by the government and in 
others it is provided by private firms that are regulated by the government. 
Both of these approaches have their advantages and disadvantages. 

For example, let us consider the case of government regulation of a nat¬ 
ural monopoly. If the regulated firm is to require no subsidy, it must make 
nonnegative profits, which means it must operate on or above the average 
cost curve. If it is to provide service to all who are willing to pay for it, 
it must also operate on the demand curve. Thus the natural operating 
position for a regulated firm is a point like (paCi Vac) i n Figure 24.6. Here 
the firm is selling its product at the average cost of production, so it covers 
its costs, but it is producing too little output relative to the efficient level 
of output. 

This solution is often adopted as a reasonable pricing policy for a natural 
monopolist. Government regulators set the prices that the public utility 
is allowed to charge. Ideally these prices are supposed to be prices that 
just allow the firm to break even—produce at a point where price equals 
average costs. 

The problem facing the regulators is to determine just what the true 
costs of the firm are. Usually there is a public utility commission that 
investigates the costs of the monopoly in an attempt to determine the true 
average cost and then sets a price that will cover costs. (Of course, one of 
these costs is the payment that the firm has to make to its shareholders 
and other creditors in exchange for the money they have loaned to the 
firm.) 

In the United States these regulatory boards operate at the state and 
local level. Typically electricity, natural gas, and telephone service operate 
in this way. Other natural monopolies like cable TV are usually regulated 
at the local level. 

The other solution to the problem of natural monopoly is to let the gov¬ 
ernment operate it. The ideal solution here in this case is to operate the 
service at price equals marginal cost and provide a lump-sum subsidy to 
keep the firm in operation. This is often the practice for local public trans¬ 
portation systems such as buses and subways. The lump-sum subsidies 
may not reflect inefficient operation per se but rather, simply reflect the 
large fixed costs associated with such public utilities. 

Then again, the subsidies may just represent inefficiency! The problem 
with government-run monopolies is that it is almost as difficult to mea- 
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Figure 

24.7 


sure their costs as it is to measure the costs of regulated public utilities. 
Government regulatory commissions that oversee the operations of public 
utilities often subject them to probing hearings to require them to justify 
cost data whereas an internal government bureaucracy may escape such 
intense scrutiny. The government bureaucrats who run such government 
monopolies may turn out to be less accountable to the public than those 
who run the regulated monopolies. 


24.7 What Causes Monopolies? 


Given information on costs and demand, when would we predict that an 
industry would be competitive and when would we predict that it would be 
monopolized? In general the answer depends on the relationship between 
the average cost curve and the demand curve. The crucial factor is the 
size of the minimum efficient scale (MES), the level of output that 
minimizes average cost, relative to the size of demand. 

Consider Figure 24.7 where we have illustrated the average cost curves 
and the market demand curves for two goods. In the first case there is room 
in the market for many firms, each charging a price close to p* and each 
operating at a relatively small scale. In the second market, only one firm 
can make positive profits. We would expect that the first market might 
well operate as a competitive market and that the second would operate 
as a monopolist. 
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mand is large relative to the minimum efficient scale, a compet¬ 
itive market is likely to result. (B) If it is small, a monopolistic 
industry structure is possible. 
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Thus the shape of the average cost curve, which in turn is determined by 
the underlying technology, is one important aspect that determines whether 
a market will operate competitively or monopolistically. If the minimum 
efficient scale of production—the level of output that minimizes average 
costs—is small relative to the size of the market, we might expect that 
competitive conditions will prevail. 

Note that this is a relative statement: what matters is the scale relative to 
the market size. We can’t do too much about the minimum efficient scale— 
that is determined by the technology. But economic policy can influence 
the size of the market. If a country chooses nonrestrictive foreign-trade 
policies, so that domestic firms face foreign competition, then the domestic 
firms’ ability to influence prices will be much less. Conversely, if a country 
adopts restrictive trade policies, so that the size of the market is limited 
only to that country, then monopolistic practices are more likely to take 
hold. 

If monopolies arise because the minimum efficient scale is large relative 
to the size of the market, and it is infeasible to increase the size of the 
market, then the industry is a candidate for regulation or other sorts of 
government intervention. Of course such regulation and intervention are 
costly too. Regulatory boards cost money, and the efforts of the firm 
to satisfy the regulatory boards can be quite expensive. From society’s 
point of view, the question should be whether the deadweight loss of the 
monopoly exceeds the costs of regulation. 

A second reason why monopoly might occur is that several different firms 
in an industry might be able to collude and restrict output in order to raise 
prices and thereby increase their profits. When firms collude in this way 
and attempt to reduce output and increase price, we say the industry is 
organized as a cartel. 

Cartels are illegal. The Antitrust Division of the Justice Department and 
the Bureau of Competition of the Federal Trade Commission are charged 
with searching for evidence of noncompetitive behavior on the part of firms. 
If the government can establish that a group of firms attempted to restrict 
output or engaged in certain other anticompetitive practices, the firms in 
question can be forced to pay heavy fines. 

On the other hand, an industry may have one dominant firm purely 
by historical accident. If one firm is first to enter some market, it may 
have enough of a cost advantage to be able to discourage other firms from 
entering the industry. Suppose, for example, that there are very large 
“tooling-up” costs to entering an industry. Then the incumbent—the firm 
already in the industry—may under certain conditions be able to convince 
potential entrants that it will cut its prices drastically if they attempt 
to enter the industry. By preventing entry in this manner, a firm can 
eventually dominate a market. We will study an example of pricing to 
prevent entry in Chapter 28. 
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EXAMPLE: Diamonds Are Forever 

The De Beers diamond cartel was formed by Sir Ernest Oppenheimer, a 
South African mine operator, in 1930. It has since grown into one of the 
world’s most successful cartels. De Beers handles over 80% of the world’s 
yearly production of diamonds and has managed to maintain this near¬ 
monopoly for several decades. Over the years, De Beers has developed 
several mechanism to maintain control of the diamond market. 

First, it maintains considerable stocks of diamonds of all types. If a 
producer attempts to sell outside the cartel, De Beers can quickly flood the 
market with the same type of diamond, thereby punishing the defector from 
the cartel. Second, large producers’ quotas are based on the proportion 
of total sales. When the market is weak, everyone’s production quota 
is reduced proportionally, thereby automatically increasing scarcity and 
raising prices. 

Third, De Beers is involved at both the mining and wholesaling levels of 
diamond production. In the wholesale market diamonds are sold to cutters 
in boxes of assorted diamonds: buyers take a whole box or nothing—they 
cannot choose individual stones. If the market is weak for a certain size 
of diamond, De Beers can reduce the number of those diamonds offered in 
the boxes, thereby making them more scarce. 

Finally, De Beers can influence the direction of final demand for diamonds 
by the $110 million a year it spends on advertising. Again, this advertising 
can be adjusted to encourage demand for the types and sizes of diamonds 
that are in relatively scarce supply. 3 


EXAMPLE: Pooling in Auction Markets 

Adam Smith once said “People of the same trade seldom meet together, 
even for merriment and diversion, but the conversation ends in a conspir¬ 
acy against the public, or in some contrivance to raise prices.” Bidding 
pools in auctions provide an illustrative example of Smith’s observation. 
In 1988 the Justice Department charged 12 Philadelphia antique dealers 
with antitrust violations for their participation in this particular kind of 
“conspiracy against the public.” 4 


3 A short description of the diamond market can be found in “The cartel lives to 
face another threat,” The Economist, January 10, 1987, 58-60. A more detailed 
description can be found in Edward J. Epstein, Cartel (New York: Putnam, 1978). 

4 See Meg Cox, “At Many Auctions, Illegal Bidding Thrives As a Longtime Practice 
Among Dealers,” Wall Street Journal, February 19, 1988, which served as the source 
for this example. 


WHAT CAUSES MONOPOLIES? 457 


The dealers were accused of participating in “bidding rings,” or “pools,” 
at antique furniture auctions. The members of a pool would appoint one 
member to bid on certain items. If this bidder succeeded in acquiring 
an item, the participating dealers would then hold a subsequent private 
auction, called a “knockout,” in which the members of the pool bid among 
themselves for the item. This practice allowed the members of the pool to 
acquire the items at much lower prices than would have prevailed if they 
had bid separately; in many cases the prices in the knockout auctions were 
50 to 100 percent greater than the prices paid to the original sellers of the 
goods. 

The dealers were surprised by the Justice Department suit; they consid¬ 
ered pooling a common business practice in their trade and did not think it 
was illegal. They thought of the pools as a tradition of cooperation among 
themselves; being invited to join a pool was considered a “mark of dis¬ 
tinction.” According to one dealer, “The day I was allowed to go into the 
pool was a banner day. If you weren’t in the pool, you weren’t considered 
much of a dealer.” The dealers were so naive that they kept careful records 
of their payments in the knockout auctions, which were later used by the 
Justice Department in the suits against the dealers. 

The Justice Department argued “if they are joining together to hold down 
the price [received by the seller] that is illegal.” The Justice Department 
view prevailed over that of the dealers: 11 of the 12 dealers pleaded guilty 
and settled the matter with fines of $1,000 to $50,000 and probation. The 
dealer who held out for a jury trial was found guilty and sentenced to 30 
days of house arrest and a fine of $30,000. 


EXAMPLE: Price Fixing in Computer Memory Markets 

DRAM chips are the “dynamic random access memory” chips that go in 
your computer. They are pretty much an undifferentiated commodity prod¬ 
uct and the market for DRAMs is (usually) highly competitive. However, 
there are allegations that several DRAM producers conspired to fix prices 
and charge computer makers a higher price than would have obtained under 
purely competitive conditions. Apple Computer, Compaq, Dell, Gateway, 
HP, and IBM were apparently affected by this conspiracy. 

The Department of Justice started investigating these allegations in 2002. 
In September 2004, Infineon, a German DRAM manufacturer, pleaded 
guilty to charges of price fixing, and agreed to pay a $160 million fine. 
This was the third largest criminal fine ever imposed by the Department 
of Justice’s antitrust division. 

According to the court documents, Infineon was charged with “Partic¬ 
ipating in meetings, conversations, and communications with competitors 
to discuss the prices of DRAM to be sold to certain customers; Agreeing to 
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price levels of DRAM to be sold to certain customers; Exchanging informa¬ 
tion on sales of DRAM to certain customers, for the purpose of monitoring 
and enforcing the agreed-upon prices.” 

Subsequently, four executives at Infineon were sentenced to prison terms 
and had to pay hefty fines. The investigation is still underway, with other 
indictments expected to follow. The antitrust authorities take price fixing 
very seriously, and the consequences to companies and individuals that 
engage in such activities can be severe. 


Summary 

1. When there is only a single firm in an industry, we say that it is a 
monopoly. 

2. A monopolist operates at a point where marginal revenue equals mar¬ 
ginal cost. Hence a monopolist charges a price that is a markup on marginal 
cost, where the size of the markup depends on the elasticity of demand. 

3. Since a monopolist charges a price in excess of marginal cost, it will 
produce an inefficient amount of output. The size of the inefficiency can 
be measured by the deadweight loss—the net loss of consumers’ and the 
producer’s surplus. 

4. A natural monopoly occurs when a firm cannot operate at an efficient 
level of output without losing money. Many public utilities are natural 
monopolies of this sort and are therefore regulated by the government. 

5. Whether an industry is competitive or monopolized depends in part on 
the nature of technology. If the minimum efficient scale is large relative to 
demand, then the market is likely to be monopolized. But if the minimum 
efficient scale is small relative to demand, there is room for many firms in 
the industry, and there is a hope for a competitive market structure. 


REVIEW QUESTIONS 


1. The market demand curve for heroin is said to be highly inelastic. Heroin 
supply is also said to be monopolized by the Mafia, which we assume to be 
interested in maximizing profits. Are these two statements consistent? 

2. The monopolist faces a demand curve given by D(jp) = 100 — 2 p. Its cost 
function is c(y) = 2 y. What is its optimal level of output and price? 

3. The monopolist faces a demand curve given by D(p) = 10p~ 3 . Its cost 
function is c(y) = 2 y. What is its optimal level of output and price? 


APPENDIX 459 


4. If D{jp) = 100/p and c(y) = y 2 , what is the optimal level of output of 
the monopolist? (Be careful.) 

5. A monopolist is operating at an output level where |e| = 3. The govern¬ 
ment imposes a quantity tax of $6 per unit of output. If the demand curve 
facing the monopolist is linear, how much does the price rise? 

6. What is the answer to the above question if the demand curve facing the 
monopolist has constant elasticity? 

7. If the demand curve facing the monopolist has a constant elasticity of 2, 
then what will be the monopolist’s markup on marginal cost? 

8. The government is considering subsidizing the marginal costs of the 
monopolist described in the question above. What level of subsidy should 
the government choose if it wants the monopolist to produce the socially 
optimal amount of output? 

9. Show mathematically that a monopolist always sets its price above mar¬ 
ginal cost. 

10. True or false? Imposing a quantity tax on a monopolist will always 
cause the market price to increase by the amount of the tax. 

11. What problems face a regulatory agency attempting to force a monop¬ 
olist to charge the perfectly competitive price? 

12. What kinds of economic and technological conditions are conducive to 
the formation of monopolies? 


APPENDIX 

Define the revenue function by r(y) = p(y)y. Then the monopolist’s profit- 
maximization problem is 

max r{y) - c(y). 

The first-order condition for this problem is simply 

r'(y) ~ c(y) = 0, 

which implies that marginal revenue should equal marginal cost at the optimal 
choice of output. 

Differentiating the definition of the revenue function gives r'(y) = p(y)+p' (y)y, 
and substituting this into the monopolist’s first-order condition yields the alter¬ 
native form 


p(y)+p(y)y = c(y). 
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The second-order condition for the monopolist’s profit-maximization problem 
is 

r"(y) ~ c"(y) < 0. 


This implies that 


c"(y) > r"(y ) 

or that the slope of the marginal cost curve exceeds the slope of the marginal 
revenue curve. 


CHAPTER 



MONOPOLY 

BEHAVIOR 


In a competitive market there are typically several firms selling an identical 
product. Any attempt by one of the firms to sell its product at more than 
the market price leads consumers to desert the high-priced firm in favor of 
its competitors. In a monopolized market there is only one firm selling a 
given product. When a monopolist raises its price it loses some, but not 
all, of its customers. 

In reality most industries are somewhere in between these two extremes. 
If a gas station in a small town raises the price at which it sells gasoline 
and it loses most of its customers, it is reasonable to think that this firm 
must behave as a competitive firm. If a restaurant in the same town raises 
its price and loses only a few of its customers, then it is reasonable to think 
that this restaurant has some degree of monopoly power. 

If a firm has some degree of monopoly power it has more options open 
to it than a firm in a perfectly competitive industry. For example, it can 
use more complicated pricing and marketing strategies than a firm in a 
competitive industry. Or it can try to differentiate its product from the 
products sold by its competitors to enhance its market power even further. 
In this chapter we will examine how firms can enhance and exploit their 
market power. 
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25.1 Price Discrimination 

We have argued earlier that a monopoly operates at an inefficient level of 
output since it restricts output to a point where people are willing to pay 
more for extra output than it costs to produce it. The monopolist doesn’t 
want to produce this extra output, because it would force down the price 
that it would be able to get for all of its output. 

But if the monopolist could sell different units of output at different 
prices, then we have another story. Selling different units of output at 
different prices is called price discrimination. Economists generally con¬ 
sider the following three kinds of price discrimination: 

First-degree price discrimination means that the monopolist sells 
different units of output for different prices and these prices may differ 
from person to person. This is sometimes known as the case of perfect 
price discrimination. 

Second-degree price discrimination means that the monopolist sells 
different units of output for different prices, but every individual who buys 
the same amount of the good pays the same price. Thus prices differ across 
the units of the good, but not across people. The most common example 
of this is bulk discounts. 

Third-degree price discrimination occurs when the monopolist sells 
output to different people for different prices, but every unit of output sold 
to a given person sells for the same price. This is the most common form 
of price discrimination, and examples include senior citizens’ discounts, 
student discounts, and so on. 

Let us look at each of these to see what economics can say about how 
price discrimination works. 


25.2 First-Degree Price Discrimination 

Under first-degree price discrimination, or perfect price discrimi¬ 
nation, each unit of the good is sold to the individual who values it most 
highly, at the maximum price that this individual is willing to pay for it. 

Consider Figure 25.1, which illustrates two consumers’ demand curves 
for a good. Think of a reservation price model for demand where the indi¬ 
viduals choose integer amounts of the goods and each step in the demand 
curve represents a change in the willingness to pay for additional units of 
the good. We have also illustrated (constant) marginal cost curves for the 
good. 

A producer who is able to perfectly price discriminate will sell each unit 
of the good at the highest price it will command, that is, at each consumer’s 
reservation price. Since each unit is sold to each consumer at his or her 
reservation price for that unit, there is no consumers’ surplus generated in 
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WILLINGNESS 
TO PAY 

WILLINGNESS 
TO PAY 



A 

B 





QUANTITY 

QUANTITY 

First-degree price discrimination. Here are two consumers’ 
demand curves for a good along with the constant marginal cost 
curve. The producer sells each unit of the good at the maximum 
price it will command, which yields it the maximum possible 
profit. 


Figure 

25.1 


this market; all the surplus goes to the producer. In Figure 25.1 the colored 
areas indicate the producer’s surplus accruing to the monopolist. In an or¬ 
dinary competitive market setting these areas would represent consumers’ 
surplus, but in the case of perfect price discrimination, the monopolist is 
able to appropriate this surplus for itself. 

Since the producer gets all the surplus in the market, it wants to make 
sure that the surplus is as large as possible. Put another way, the producer’s 
goal is to maximize its profits (producer’s surplus) subject to the constraint 
that the consumers are just willing to purchase the good. This means 
that the outcome will be Pareto efficient, since there will be no way to 
make both the consumers and the producer better off: the producer’s profit 
can’t be increased, since it is already the maximal possible profit, and the 
consumers’ surplus can’t be increased without reducing the profit of the 
producer. 

If we move to the smooth demand curve approximation, as in Figure 25.2, 
we see that a perfectly price-discriminating monopolist must produce at an 
output level where price equals marginal cost: if price were greater than 
marginal cost, that would mean that there is someone who is willing to pay 
more than it costs to produce an extra unit of output. So why not produce 
that extra unit and sell it to that person at his or her reservation price, 
and thus increase profits? 

Just as in the case of a competitive market, the sum of producer’s and 
consumers’ surpluses is maximized. However, in the case of perfect price 
discrimination the producer ends up getting all the surplus generated in 
the market! 

We have interpreted first-degree price discrimination as selling each unit 
at the maximum price it will command. But we could also think of it as 
selling a fixed amount of the good at a “take it or leave it” price. In the 
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Figure 

25.2 


case illustrated in Figure 25.2, the monopolist would offer to sell x\ units of 
the good to person 1 at a price equal to the area under person l’s demand 
curve and offer to sell x% units of the good to person 2 at a price equal to 
the area under person 2’s demand curve B. As before, each person would 
end up with zero consumer’s surplus, and the entire surplus of A + B would 
end up in the hands of the monopolist. 

Perfect price discrimination is an idealized concept—as the word “per¬ 
fect” might suggest— but it is interesting theoretically since it gives us an 
example of a resource allocation mechanism other than a competitive mar¬ 
ket that achieves Pareto efficiency. There are very few real-life examples 
of perfect price discrimination. The closest example would be something 
like a small-town doctor who charges his patients different prices, based on 
their ability to pay. 

EXAMPLE: First-degree Price Discrimination in Practice 

As mentioned earlier, first-degree price discrimination is primarily a theo¬ 
retical concept. It’s hard to find real-world examples in which every indi¬ 
vidual is charged a different price. One possible example would be cases 
where prices are set by bargaining, as in automobile sales or in antique 
markets. However, these are not ideal examples. 

Southwest Airlines recently introduced a system called Ding that at¬ 
tempts something rather close to first-degree price discrimination. 1 The 

1 See Christopher Elliott, “Your Very Own Personal Air Fare,” New York Times, Au¬ 
gust 9, 2005. 



First-degree price discrimination with smooth demand 
curves. Here are two consumers’ smoothed demand curves 
for a good along with the constant marginal cost curve. Here 
the producer maximizes profits by producing where price equals 
marginal cost, just as in the case of a competitive market. 
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system uses the Internet in a clever way. The user installs a program on 
her computer and the airline sends special fare offers to the user period¬ 
ically. The fares are announced with a “ding” sound, hence the system 
name. According to one analyst, the fares offered by Ding were about 30 
percent lower than comparable fares. 

But will these low fares persist? One might also use such a system 
to offer higher fares. However, that possibility seems unlikely given the 
intensely competitive nature of the airline industry. It’s easy to switch 
back to standard ways of buying tickets if prices start creeping up. 


25.3 Second-Degree Price Discrimination 


Second-degree price discrimination is also known as the case of non¬ 
linear pricing, since it means that the price per unit of output is not 
constant but depends on how much you buy. This form of price discrimi¬ 
nation is commonly used by public utilities; for example, the price per unit 
of electricity often depends on how much is bought. In other industries 
bulk discounts for large purchases are sometimes available. 

Let us consider the case depicted earlier in Figure 25.2. We saw that the 
monopolist would like to sell an amount x ° to person 1 at price A+ cost 
and an amount 2° to person 2 at price B+ cost. To set the right prices, 
the monopolist has to know the demand curves of the consumers; that is, 
the monopolist has to know the exact willingness to pay of each person. 
Even if the monopolist knows something about the statistical distribution 
of willingness to pay—for example, that college students are willing to pay 
less than yuppies for movie tickets—it might be hard to tell a yuppie from 
a college student when they are standing in line at the ticket booth. 

Similarly, an airline ticket agent may know that business travelers are 
willing to pay more than tourists for their airplane tickets, but it is often 
difficult to tell whether a particular person is a business traveler or a tourist. 
If switching from a grey flannel suit to Bermuda shorts would save $500 on 
travel expenses, corporate dress codes could change quickly! 

The problem with the first-degree price discrimination example depicted 
in Figure 25.2 is that person 1—the high-willingess-to-pay person—can 
pretend to be person 2, the low-willingess-to-pay person. The seller may 
have no effective way to tell them apart. 

One way to get around this problem is to offer two different price-quantity 
packages in the market. One package will be targeted toward the high- 
demand person, the other package toward the low-demand person. It can 
often happen that the monopolist can construct price-quantity packages 
that will induce the consumers to choose the package meant for them; in 
economics jargon, the monopolist constructs price-quantity packages that 
give the consumers an incentive to self select. 


466 MONOPOLY BEHAVIOR (Ch. 25) 


In order to see how this works, Figure 25.3 illustrates the same kind of 
demand curves used in Figure 25.2, but now laid on top of each other. 
We’ve also set marginal cost equal to zero in this diagram to keep the 
argument simple. 


Figure 

25.3 
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Second-degree price discrimination. These are the de¬ 
mand curves of two consumers; the producer has zero marginal 
cost by assumption. Panel A illustrates the self-selection prob¬ 
lem. Panel B shows what happens if the monpolist reduces 
the output targeted for consumer 1, and panel C illustrates the 
profit-maximizing solution. 


As before, the monopolist would like to offer x ° at price A and to of¬ 
fer x 2 price A + B + C. This would capture all the surplus for the 
monopolist and generate the most possible profit. Unfortunately for the 
monopolist, these price-quantity combinations are not compatible with self¬ 
selection. The high-demand consumer would find it optimal to choose the 
quantity x® and pay price A; this would leave him with a surplus equal 

to area B , which is better than the zero surplus he would get if he chose 
~,o 

One thing the monopolist can do is to offer cc 2 at a price of A + C. In this 
case the high-demand consumer finds it optimal to choose cc 2 and receive 
a gross surplus of A + B + C. He pays the monopolist A + C , which yields 
a net surplus of B for consumer 2— just what he would get if he chose x 
This generally yields more profit to the monopolist than it would get by 
offering only one price-quantity combination. 

But the story doesn’t end here. There’s yet a further thing the mo¬ 
nopolist can do to increase profits. Suppose that instead of offering x\ at 
price A to the low-demand consumer, the monopolist offers a bit less than 
that at a price slightly less than A. This reduces the monopolist’s profits 
on person 1 by the small colored triangle illustrated in Figure 25.3B. But 
note that since person l’s package is now less attractive to person 2, the 
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monopolist can now charge more to person 2 for xty- By reducing x the 
monopolist makes area A a little smaller (by the dark triangle) but makes 
area C bigger (by the triangle plus the light trapezoid area). The net result 
is that the monopolist’s profits increase. 

Continuing in this way, the monopolist will want to reduce the amount 
offered to person 1 up to the point where the profit lost on person 1 due 
to a further reduction in output just equals the profit gained on person 2. 
At this point, illustrated in Figure 25.3C, the marginal benefits and costs 
of quantity reduction just balance. Person 1 chooses x™ and is charged A; 
person 2 chooses x° and is charged A + C + D. Person 1 ends up with a 
zero surplus and person 2 ends up with a surplus of B —just what he would 
get if he chose to consume x™. 

In practice, the monopolist often encourages this self-selection not by ad¬ 
justing the quantity of the good, as in this example, but rather by adjusting 
the quality of the good. The quantities in the model just examined can be 
re-interpreted as qualities, and everything works as before. In general, the 
monopolist will want to reduce the quality offered to the low end of its 
market so as not to cannibalize sales at the high end. Without the high- 
end consumers, the low-end consumers would be offered higher quality, but 
they would still end up with zero surplus. Without the low-end consumers, 
the high-end consumers would have zero surplus, so it is beneficial to the 
high-end consumers to have the low-end consumers present. This is be¬ 
cause the monopolist has to cut the price to the high-end consumers to 
discourage them from choosing the product targeted to the low-end con¬ 
sumers. 


EXAMPLE: Price Discrimination in Airfares 

The airline industry has been very successful at price discrimination (al¬ 
though industry representatives prefer to use the term “yield manage¬ 
ment.”) The model described above applies reasonably well to the problem 
faced by airlines: there are essentially two types of consumers, business 
travelers and individual travelers, who generally have quite different will¬ 
ingnesses to pay. Although there are several competing airlines in the U.S. 
market, it is quite common to see only one or two airlines serving specific 
city pairs. This gives the airlines considerable freedom in setting prices. 

We have seen that the optimal pricing policy for a monopolist dealing 
with two groups of consumers is to sell to the high-willingness-to-pay mar¬ 
ket at a high price and offer a reduced-quality product to the market with 
the lower willingness to pay. The point of the reduced-quality product is 
to dissuade those with a high willingness to pay from purchasing the lower 
priced good. 

The way the airlines implement this is to offer an “unrestricted fare” 
for business travel and a “restricted fare” for non-business travel. The 
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restricted fare often requires advanced purchase, a Saturday-night stayover, 
or other such impositions. The point of these impositions, of course, is to 
be able to discriminate between the high-demand business travelers and 
the more price sensitive individual travelers. By offering a “degraded” 
product—the restricted fares—the airlines can charge the customers who 
require flexible travel arrangements considerably more for their tickets. 

Such arrangements may well be socially useful; without the ability to 
price discriminate, a firm may decide that it is optimal to sell only to the 
high-demand markets. 

Another way that airlines price discriminate is with first-class and coach- 
class travel. First-class travelers pay substantially more for their tickets, 
but they receive an enhanced level of service: more space, better food, and 
more attention. Coach-class travelers, on the other hand, receive a lower 
level of service on all these dimensions. This sort of quality discrimination 
has been a feature of transportation services for hundreds of years. Wit¬ 
ness, for example, this commentary on railroad pricing by Emile Dupuit, a 
nineteenth century French economist: 


It is not because of the few thousand francs which would have to 
be spent to put a roof over the third-class carriage or to upholster the 
third-class seats that some company or other has open carriages with 
wooden benches ... What the company is trying to do is prevent the 
passengers who can pay the second-class fare from traveling third class; 
it hits the poor, not because it wants to hurt them, but to frighten the 
rich ... And it is again for the same reason that the companies, having 
proved almost cruel to the third-class passengers and mean to the 
second-class ones, become lavish in dealing with first-class customers. 
Having refused the poor what is necessary, they give the rich what is 
superfluous. 2 

The next time you fly coach class, perhaps it will be of some solace to 
know that rail travel in nineteenth century France was even more uncom¬ 
fortable! 


EXAMPLE: Prescription Drug Prices 

A month’s supply of the antidepressant Zoloft sells for $29.74 in Austria, 
$32.91 in Luxembourg, $40.97 in Mexico, and $64.67 in the United States. 
Why the difference? Drug makers, like other firms, charge what the market 


2 


Translation by R. B. Ekelund in “Price Discrimination and Product Differentiation in 
Economic Theory: An Early Analysis,” Quarterly Journal of Economics, 84 (1970), 


268-78. 
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will bear. Poorer countries can’t pay as much as richer ones, so drug prices 
tend to be lower. 

But that’s not the whole story. Bargaining power also differs dramati¬ 
cally from country to country. Canada, which has a national health plan, 
often has lower drug prices than the United States, where there is no cen¬ 
tralized provider of health care. 

It has been proposed that drug companies be forced to charge a single 
price worldwide. Leaving aside the thorny question of enforcement, we 
might well ask what the consequences of such a policy would be. Would 
the world overall end up with lower prices or higher prices? 

The answer depends on the relative size of the market. A drug for malaria 
would find most of its demand in poor countries. If forced to charge a single 
price, drug companies would likely sell such a drug at a low price. But a 
drug for diseases that afflicted those in wealthy countries would likely sell 
for a high price, making it too expensive for those in poorer areas. 

Typically, moving from price discrimination to a single-price regime will 
raise some prices and lower others, making some people better off and some 
people worse off. In some cases, a product may not be supplied at all to 
some markets if a seller is forced to apply uniform pricing. 


25.4 Third-Degree Price Discrimination 

Recall that this means that the monopolist sells to different people at dif¬ 
ferent prices, but every unit of the good sold to a given group is sold at the 
same price. Third-degree price discrimination is the most common form 
of price discrimination. Examples of this might be student discounts at 
the movies, or senior citizens’ discounts at the drugstore. How does the 
monopolist determine the optimal prices to charge in each market? 

Let us suppose that the monopolist is able to identify two groups of 
people and can sell an item to each group at a different price. We suppose 
that the consumers in each market are not able to resell the good. Let us 
use pi(yi) and £> 2 ( 2 / 2 ) to denote the inverse demand curves of groups 1 and 
2, respectively, and let 0 ( 2/1 + 2 / 2 ) be the cost of producing output. Then 
the profit-maximization problem facing the monopolist is 

max pi(yi)yi +£> 2 ( 222 ) 2/2 - c(y 1 + y 2 ). 
y 1,2/2 

The optimal solution must have 

MR± ( 2 / 1 ) = MC(yi + y 2 ) 

MR 2 (y 2 ) = MC(yi + y 2 ). 

That is, the marginal cost of producing an extra unit of output must be 
equal to the marginal revenue in each market. If the marginal revenue in 
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market 1 exceeded marginal cost, it would pay to expand output in market 
1, and similarly for market 2. Since marginal cost is the same in each 
market, this means of course that marginal revenue in each market must 
also be the same. Thus a good should bring the same increase in revenue 
whether it is sold in market 1 or in market 2. 

We can use the standard elasticity formula for marginal revenue and 
write the profit-maximization conditions as 


PiiVi) 


ki(yi)L 


MC(yi + y 2 ) 


P2{V2 ) 


MzhOI. 


MC(yi + y 2 ), 


where £i(?/i) and e 2 (y 2 ) represent the elasticities of demand in the respec¬ 
tive markets, evaluated at the profit-maximizing choices of output. 

Now note the following. If p\ > p 2 , then we must have 


1 \ei(yi)\ <1 \e 2 (y 2 ) |’ 

which in turn implies that 

1 1 

Mz/i)l > MW)!’ 

This means that 

|e 2 (2/2)| > |ei(?/i)|. 


Thus the market with the higher price must have the lower elasticity of 
demand. Upon reflection, this is quite sensible. An elastic demand is a 
price-sensitive demand. A firm that price discriminates will therefore set a 
low price for the price-sensitive group and a high price for the group that 
is relatively price insensitive. In this way it maximizes its overall profits. 

We suggested that senior citizens’ discounts and student discounts were 
good examples of third-degree price discrimination. Now we can see why 
they have discounts. It is likely that students and senior citizens are more 
sensitive to price than the average consumer and thus have more elastic 
demands for the relevant region of prices. Therefore a profit-maximizing 
firm will price discriminate in their favor. 


EXAMPLE: Linear Demand Curves 

Let us consider a problem where the firm faces two markets with linear 
demand curves, X\ = a — bpi and x 2 = c — dp 2 . Suppose for simplicity 
that marginal costs are zero. If the firm is allowed to price discriminate, 
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it will produce where marginal revenue equals zero in each market—at a 
price and output combination that is halfway down each demand curve, 
with outputs x* = a/2 and = c/2 and prices p\ = a/26 and p?; = c/2d. 

Suppose that the firm were forced to sell in both markets at the same 
price. Then it would face a demand curve of x = (a + c) — (6 + d)p and 
would produce halfway down this demand curve, resulting in an output of 
x* = (a + c)/2 and price of p* = (a + c)/2(b + d). Note that the total 
output is the same whether or not price discrimination is allowed. (This is 
a special feature of the linear demand curve and does not hold in general.) 

However, there is an important exception to this statement. We have 
assumed that when the monopolist chooses the optimal single price it will 
sell a positive amount of output in each market. It may very well happen 
that at the profit-maximizing price, the monopolist will sell output to only 
one of the markets, as illustrated in Figure 25.4. 



Price discrimination with linear demands. If the monop¬ 
olist can charge only one price, it will charge p\, and sell only 
to market 1. But if price discrimination is allowed, it will also 
sell at price p\ to market 2. 


Figure 

25.4 


Here we have two linear demand curves; since marginal cost is assumed to 
be zero, the monopolist will want to operate at a point where the elasticity 
of demand is —1, which we know to be halfway down the market demand 
curve. Thus the price p\ is a profit-maximizing price—lowering the price 
any further would reduce revenues in market 1. If the demand in market 2 
is very small, the monopolist may not want to lower its price any further in 
order to sell to this market: it will end up selling only to the larger market. 
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In this case, allowing price discrimination will unambiguously increase 
total output, since the monopolist will find it in its interest to sell to both 
markets if it can charge a different price in each one. 


EXAMPLE: Calculating Optimal Price Discrimination 

Suppose that a monopolist faces two markets with demand curves given by 


Di(pi) = 100 — pi 
D 2 (p 2 ) = 100 - 2p 2 . 

Assume that the monopolist’s marginal cost is constant at $20 a unit. If it 
can price discriminate, what price should it charge in each market in order 
to maximize profits? What if it can’t price discriminate? Then what price 
should it charge? 

To solve the price-discrimination problem, we first calculate the inverse 
demand functions: 

£1(2/1) = 100 - yi 
£ 2 ( 2 / 2 ) = 50 - 2 / 2 / 2 . 

Marginal revenue equals marginal cost in each market yields the two equa¬ 
tions: 

100 - 22/1 = 20 

50 - 2/2 = 20. 

Solving we have y\ = 40 and y 2 = 30. Substituting back into the inverse 
demand functions gives us the prices p\ = 60 and p 2 = 35. 

If the monopolist must charge the same price in each market, we first 
calculate the total demand: 

D(p) = Di(pi) + D 2 (p 2 ) = 200 - 3 p. 

The inverse demand curve is 


p(y ) = 


200 y 

T~ ~ 3' 


Marginal revenue equals marginal cost gives us 


200 

”3” 



which can be solved to give y* = 70 and p* = 431. 

In accord with the discussion in the previous section, it is important 
to check that this price generates non-negative demands in each market. 
However, it is easily checked that this is the case. 
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EXAMPLE: Price Discrimination in Academic Journals 

Most written scholarly communication takes place in academic journals. 
These journals are sold by subscription to libraries and to individual schol¬ 
ars. It is very common to see different subscription prices being charged to 
libraries and individuals. In general, we would expect that the demand by 
libraries would be much more inelastic than demand by individuals, and, 
just as economic analysis would predict, the prices for library subscriptions 
are typically much higher than the prices for individual subscriptions. Of¬ 
ten library subscriptions are 2 to 3 times more expensive than subscriptions 
to individuals. 

More recently, some publishers have begun to price discriminate by ge¬ 
ography. During 1984, when the U.S. dollar was at an all-time high as 
compared to the English pound, many British publishers began to charge 
different prices to U.S. subscribers than to European subscribers. It would 
be expected that the U.S. demand would be more inelastic. Since the dol¬ 
lar price of British journals was rather low due to the exchange rate, a 10 
percent increase in the U.S. price would result in a smaller percentage drop 
in demand than a similar increase in the British price. Thus, on grounds 
of profit maximization, it made sense for the British publishers to raise the 
prices of their journals to the group with the lower elasticity of demand— 
the U.S. subscribers. According to a 1984 study, North American libraries 
were charged an average of 67 percent more for their journals than U.K. 
libraries, and 34 percent more than anyone else in the world. 3 

Further evidence for price discrimination can be found by examining 
the pattern of price increases. According to a study by the University 
of Michigan Library, “... publishers have carefully considered their new 
pricing strategy. There seems to be a direct correlation ... between patterns 
of library usage and the magnitude of the pricing differential. The greater 
the use, the larger the differential.” 4 

By 1986 the exchange rate had turned in favor of the pound, and the 
dollar prices of the British journals had increased significantly. Along with 
the price increase came some serious resistance to the higher prices. The 
concluding sentences of the Michigan report are illustrative: “One expects 
that a vendor with a monopoly on a product will charge according to 
demand. What the campus as a customer must determine is whether it 
will continue to pay up to 114% more than its British counterparts for the 
identical product.” 


3 ttamaker, C. and Astle, D., “Recent Pricing Patterns in British Journal Publishing,” 
Library Acquisitions: Practice and Theory , 8, 4 (Spring 1984), 225—32. 

4 The study was conducted by Robert Houbeck for the University of Michigan Library, 
and published in Vol. 2, No. 1 of the University Library Update , April 1986. 
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25.5 Bundling 

Firms often choose to sell goods in bundles: packages of related goods 
offered for sale together. A noteworthy example is a bundle of software, 
sometimes known as a “software suite.” Such a bundle might consist of 
several different software tools—a word processor, a spreadsheet, and a 
presentation tool—that are sold together in one set. Another example is a 
magazine: this consists of a bundle of articles that could, in principle, be 
sold separately. Similarly, magazines are often sold via subscription—which 
is just a way of bundling separate issues together. 

Bundling can be due to cost savings: it is often less expensive to sell 
several articles stapled together than it is to sell each of them separately. 
Or it may be due to complementarities among the goods involved: software 
programs sold in bundles often work together more effectively than off-the- 
shelf programs. 

But there can also be reasons involving consumer behavior. Let’s con¬ 
sider a simple example. Suppose that there are two classes of consumers 
and two different software programs, a word processor and a spreadsheet. 
Type A consumers are willing to pay $120 for the word processor and $100 
for the spreadsheet. Type B consumers have the opposite preferences: they 
are willing to pay $120 for the spreadsheet and $100 for the word processor. 
This information is summarized in Table 25.1. 


Table Willingness to pay for software components. 

25.1 


Type of consumer 

Word processor 

Spreadsheet 

Type A consumers 

120 

100 

Type B consumers 

100 

120 


Suppose that you are selling these products. For simplicity, let us assume 
that the marginal cost is negligible so that you only want to maximize rev¬ 
enue. Furthermore, make the conservative assumption that the willingess 
to pay for the bundle consisting of the word processor and the spreadsheet 
is just the sum of the willingesses to pay for each component. 

Now consider the profits from two different marketing policies. First, 
suppose that you sell each item separately. The revenue maximizing policy 
is to set a price of $100 for each piece of software. If you do this, you will 
sell two copies of the word processor and two copies of the spreadsheet, 
and receive a total revenue of $400. 
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But what if you bundle the items together? In this case, you could sell 
each bundle for $220, and receive a net revenue of $440. The bundling 
strategy is clearly more attractive! 

What is going on in this example? Recall that when you sell an item 
to several different people, the price is determined by the purchaser who 
has the lowest willingess to pay. The more diverse the valuations of the 
individuals, the lower the price you have to charge to sell a given number 
of items. In this case bundling the word processor and the spreadsheet 
reduces the dispersion of willingess to pay—allowing the monopolist to set 
a higher price for the bundle of goods. 


EXAMPLE: Software Suites 

Microsoft, Lotus, and other software manufacturers have taken to bundling 
much of their applications software. For example, in 1993 Microsoft offered 
a spreadsheet, word processor, presentation tool, and database as the “Mi¬ 
crosoft Office” package at a suggested retail price of $750. (The discounted 
“street price” was about $450.) If bought separately, the individual soft¬ 
ware applications would total $1,565! Lotus offered its “Smart Suite” at 
essentially the same price; its separate components sold for a total of $1,730. 

According to an article by Steve Lohr in the October 15, 1993, New York 
Times, 50 percent of Microsoft’s applications software was sold in bundles, 
and generated revenue of over $1 billion a year. 

These software suites fit the bundling model well. Tastes for software are 
often very heterogeneous. Some people use a word processor every day and 
use a spreadsheet only occasionally. Other people have the reverse pattern 
of software use. If you wish to sell a spreadsheet to a large number of 
users, you have to sell it at a price that will be attractive to an occasional 
user. Similarly with the word processor: it is the willingness to pay of the 
marginal user that sets the market price. By bundling the two products 
together, the dispersion of willingnesses to pay is reduced and total profits 
can increase. 

This is not to say that bundling is the whole story in software suites; 
other phenomena are also at work. The individual components of the 
suites are guaranteed to work well together; they are complementary goods 
in this respect. Furthermore, the success of a piece of software tends to 
depend strongly on how many people use it, and bundling software helps 
to build market share. We will investigate this phenomenon of network 
externalities in a subsequent chapter. 


25.6 Two-Part Tariffs 

Consider the pricing problem facing the owners of an amusement park. 
They can set one price for tickets to get into the park and another price for 
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the rides. How should they set these two prices if they want to maximize 
profits? Note that the demand for access and the demand for rides are 
interrelated: the price that people are willing to pay to get into the park 
will depend on the price that they have to pay for the rides. This kind of 
two-part pricing scheme is known as a two-part tariff . 5 

Other applications of two-part tariffs abound: Polaroid sells its camera 
for one price and its film for another. People who are deciding whether 
or not to purchase the camera presumably consider the price of the film. 
A company that makes razor blades sells the razor for one price and the 
blades for another—again the price they set for the blades influences the 
demand for razors and vice versa. 

Let us consider how to solve this pricing problem in the context of the 
original example: the so-called Disneyland Dilemma. As usual we will 
make some simplifying assumptions. First, we assume that there is only 
one kind of ride in Disneyland. Second, we assume that people only desire 
to go to Disneyland for the rides. Finally, we assume that everyone has the 
same tastes for rides. 

In Figure 25.5 we have depicted the demand curve and the (constant) 
marginal cost curve for rides. As usual the demand curve slopes down—if 
Disney sets a high price for each ride, fewer rides will be taken. Suppose 
that they set a price of p*, as in Figure 25.5, that leads to a demand for 
x* rides. How much will they be able to charge for admission to the park, 
given that the rides cost p*l 

The total willingness to pay for x* rides is measured by the consumers’ 
surplus. Hence the most that the owners of the park can charge for admis¬ 
sion is the area labeled “consumer’s surplus” in Figure 25.5. The total prof¬ 
its to the monopolist is this area plus the profit on the rides, (p* — MC)x*. 

It is not hard to see that total profits are maximized when price equals 
marginal cost: we’ve seen before that this price gives the largest possible 
consumer plus producer surplus. Since the monopolist gets to charge people 
their consumers’ surplus, setting price equal to marginal cost and the entry 
fee to the resulting consumer’s surplus is the profit-maximizing policy. 

Indeed, this is the policy that Disneyland, and most other amusement 
parks follow. There is one price for admission, but then the attractions 
inside are free. It appears that the marginal cost of the rides is less than 
the transactions cost of collecting a separate payment for them. 


25.7 Monopolistic Competition 

We have described a monopolistic industry as being one in which there is a 
single large producer. But we’ve been somewhat vague about exactly what 


5 


See the classic article by Walter Oi, “A Disneyland Dilemma: Two-Part Tariffs for a 
Mickey Mouse Monopoly,” Quarterly Journal of Economics, 85 (1971), 77-96. 
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Disneyland Dilemma. If the owners of the park set a price 
of p*, then x* rides will be demanded. The consumers’ surplus 
measures the price that they can charge for admission to the 
park. The total profits of the firm are maximized when the 
owners set price equal to marginal cost. 


Figure 

25.5 


comprises an industry. One definition of an industry is that it consists of 
all firms that produce a given product. But then what do we mean by 
product? After all, there is only one firm that produces Coca-Cola—does 
that mean that this firm is a monopolist? 

Clearly the answer is no. The Coca-Cola firm still has to compete 
with other producers of soft drinks. We should really think of an indus¬ 
try as being the set of firms that produce products that are viewed as 
close substitutes by consumers. Each firm in the industry can produce a 
unique product—a unique brand name, say— but consumers view each of 
the brands as being substitutes to some degree. 

Even though a firm may have a legal monopoly on its trademarks, and 
brand names, so that other firms can’t produce exactly the same product, 
it is usually possible for other firms to produce similar products. From 
the viewpoint of a given firm, the production decisions of its competitors 
will be a very important consideration in deciding exactly how much it will 
produce and what price it can charge. 

Thus the demand curve facing a firm will usually depend on the out¬ 
put decisions and the prices charged by other firms that produce similar 
products. The slope of the demand curve facing the firm will depend on 
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how similar the other firms’ products are. If a large number of the firms 
in the industry produce identical products, then the demand curve facing 
any one of them will be essentially flat. Each firm must sell its product for 
whatever price the other firms are charging. Any firm that tried to raise 
its price above the prices of the other firms selling identical products would 
soon lose all of its customers. 

On the other hand, if one firm has the exclusive rights to sell a particular 
product, then it may be able to raise its price without losing all of its 
customers. Some, but not all, of its customers may switch to competitors’ 
products. Just how many customers switch depends on how similar the 
customers think the products are—that is, on the elasticity of the demand 
curve facing the firm. 

If a firm is making a profit selling a product in an industry, and other 
firms are not allowed to perfectly reproduce that product, they still may 
find it profitable to enter that industry and produce a similar but distinctive 
product. Economists refer to this phenomenon as product differentia¬ 
tion —each firm attempts to differentiate its product from the other firms 
in the industry. The more successful it is at differentiating its product from 
other firms selling similar products, the more monopoly power it has—that 
is, the less elastic is the demand curve for the product. For example, 
consider the soft drink industry. In this industry there are a number of 
firms producing similar, but not identical products. Each product has its 
following of consumers, and so has some degree of market power. 

An industry structure such as that described above shares elements of 
both competition and monopoly; it is therefore referred to as monopolistic 
competition. The industry structure is monopolistic in that each firm 
faces a downward-sloping demand curve for its product. It therefore has 
some market power in the sense that it can set its own price, rather than 
passively accept the market price as does a competitive firm. On the other 
hand the firms must compete for customers in terms of both price and the 
kinds of products they sell. Furthermore, there are no restrictions against 
new firms entering into a monopolistically competitive industry. In these 
aspects the industry is like a competitive industry. 

Monopolistic competition is probably the most prevalent form of industry 
structure. Unfortunately, it is also the most difficult form to analyze. The 
extreme cases of pure monopoly and pure competition are much simpler 
and can often be used as first approximations to more elaborate models of 
monopolistic competition. In a detailed model of a monopolistically com¬ 
petitive industry, much depends on the specific details of the products and 
technology, as well as on the nature of the strategic choices available to 
firms. It is unreasonable to model a monopolistically competitive industry 
in the abstract, as we have done with the simpler cases of pure competi¬ 
tion and pure monopoly. Rather, the institutional details of the particular 
industry under consideration must be examined. We will describe some 
methods that economists use to analyze strategic choice in the next two 
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chapters, but a detailed study of monopolistic competition will have to wait 
for more advanced courses. 

We can, however, describe an interesting feature of the free entry aspect 
of monopolistic competition. As more and more firms enter the industry 
for a particular kind of product, how would we expect the demand curve 
of an incumbent firm to change? First, we would expect the demand curve 
to shift inward since we would expect that at each price, it would sell fewer 
units of output as more firms enter the industry. Second, we would expect 
that the demand curve facing a given firm would become more elastic as 
more firms produced more and more similar products. Thus entry into an 
industry by new firms with similar products will tend to shift the demand 
curves facing existing firms to the left and make them flatter. 

If firms continue to enter the industry as long as they expect to make a 
profit, equilibrium must satisfy the following three conditions: 

1. Each firm is selling at a price and output combination on its demand 

curve. 

2. Each firm is maximizing its profits, given the demand curve facing it. 

3. Entry has forced the profits of each firm down to zero. 

These facts imply a very particular geometrical relationship between the 
demand curve and the average cost curve: the demand curve and the av¬ 
erage cost curve must be tangent to each other. 

The argument is illustrated in Figure 25.6. Fact 1 says that the output 
and price combination must be somewhere on the demand curve, and fact 
3 says that the output and price combination must also be on the average 
cost curve. Thus the operating position of the firm must be at a point that 
lies on both curves. Could the demand curve cross the average cost curve? 
No, because then there would be some point on the demand curve above 
the average cost curve—but this would be a point yielding positive profits. 6 
And by fact 2, the zero profit point is a profit maximum. 

Another way to see this is to examine what would happen if the firm 
depicted in Figure 25.6 charged any price other than the break-even price. 
At any other price, higher or lower, the firm would lose money, while at the 
break-even price, the firm makes zero profits. Thus the break-even price is 
the profit-maximizing price. 

There are two worthwhile observations about the monopolistically com¬ 
petitive equilibrium. First, although profits are zero, the situation is still 
Pareto inefficient. Profits have nothing to do with the efficiency question: 
when price is greater than marginal cost, there is an efficiency argument 
for expanding output. 


If p > c(y)/y, then simple algebra shows that py — c(y) > 0. 
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Figure 

25.6 



Monopolistic competition. In a monopolistically compet¬ 
itive equilibrium with zero profits, the demand curve and the 
average cost curve must be tangent. 


Second, it is clear that firms will typically be operating to the left of 
the level of output where average cost is minimized. This has sometimes 
been interpreted as saying that in monopolistic competition there is “excess 
capacity.” If there were fewer firms, each could operate at a more efficient 
scale of operation, which would be better for consumers. However, if there 
were fewer firms there would also be less product variety, and this would 
tend to make consumers worse off. Which of these effects dominates is a 
difficult question to answer. 


25.8 A Location Model of Product Differentiation 

In Atlantic City there is a boardwalk that stretches along the beach. Some 
ice cream vendors with pushcarts want to sell ice cream on the boardwalk. 
If one vendor is going to be given the concession to sell ice cream on the 
boardwalk, where should he locate? 7 

Suppose that consumers are distributed evenly along the beach. From a 
social point of view, it makes sense to locate the ice cream vendor so that 


7 The discussion here is based on the classic model of Harold Hotelling, “Stability in 
Competition,” Economic Journal, March 1929. 
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the total distance walked by all the consumers is minimized. It is not hard 
to see that this optimal location is halfway along the boardwalk. 

Now suppose that two ice cream vendors are allowed. Suppose that we 
fix the price that they are able to charge for their ice cream and just ask 
where they should locate in order to minimize the total distance walked. If 
each consumer walks to the ice cream vendor nearest him, we should put 
one vendor a quarter of the way along the boardwalk and one vendor three- 
quarters of the way along the boardwalk. The consumer halfway along the 
boardwalk will be indifferent between the two ice cream vendors; each has 
a market share of one-half of the consumers. (See Figure 25.7A.) 

But do the ice cream vendors have an incentive to stay in these locations? 
Put yourself in the position of vendor L. If you move a little bit to the 
right, you will steal some of the other vendor’s customers and you won’t 
lose any of your own. By moving to the right, you will still be the closest 
vendor to all the customers to your left and you will still be closer to the 
customers on your right. You will therefore increase your market share and 
your profits. 


L 

♦ 


R L R 

• - 1 • 


Market share Market share 
of vendor L of vendor R 


Market share Market share 
of vendor L of vendor R 


A 


B 


Competition in location. Panel A shows the socially optimal 
location pattern; L locates one-quarter of the way along the line 
and R locates three-quarters of the way along. But each vendor 
will find it in its private interest to move toward the middle. 
The only equilibrium location is for both vendors to be in the 
middle, as shown in Panel B. 


Figure 

25.7 


But vendor R can reason the same way—by moving to the left, he will 
steal some of the other vendor’s customers and not lose any of his own! 
This shows that the socially optimal location patterns are not an equi- 
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librium. The only equilibrium is for both vendors to sell in the middle 
of the boardwalk, as shown in Figure 25.7B. In this case, competition for 
customers has resulted in an inefficient location pattern. 

The boardwalk model can serve as a metaphor for other sorts of product- 
differentiation problems. Instead of the boardwalk, think of the choice of 
music varieties by two radio stations. At one extreme we have classical 
music and at the other we have heavy metal rock. Each listener chooses 
the station that appeals more to his tastes. If the classical station plays 
music that is a bit more toward the middle of the taste spectrum, it won’t 
lose the classical clients, but it will gain a few of the middlebrow listeners. 
If the rock station moves a bit toward the middle, it won’t lose any of its 
rock lovers but will get a few of the middlebrow listeners. In equilibrium, 
both stations play the same sort of music and the people with more extreme 
tastes are unhappy with both of them! 


25.9 Product Differentiation 

The boardwalk model suggest that monopolistic competition will result in 
too little product differentiation: each firm will want to make its product 
similar to that of the other firm in order to steal the other firm’s customers. 
Indeed, we can think of markets in which there is too much imitation 
relative to what seems to be optimal. 

However, it doesn’t always work this way. Suppose that the boardwalk 
is very long. Then each ice cream vendor would be perfectly happy sitting 
near each end of the boardwalk. If their market areas don’t overlap, nothing 
is to be gained from moving closer to the middle of the boardwalk. In this 
case, neither monopolist has an incentive to imitate the other, and the 
products are about as different as they can get. 

It is possible to produce models of monopolistic competition where there 
is excessive product differentiation. In such models, each firm attempts 
to make consumers think that its product is different from the products 
of its competitors so as to create some degree of market power. If the 
firms succeed in convincing the consumers that their product has no close 
substitutes, they will be able to charge a higher price for it than they would 
otherwise be able to do. 

This leads each producer to invest heavily in creating a distinctive brand 
identity. Laundry soap, for example, is a pretty standardized commodity. 
Yet manufacturers invest huge amounts of money in advertisements that 
claim cleaner clothes, better smell, a better marriage, and and a generally 
happier life if you choose their brand rather than a competitor’s. This 
“product positioning” is much like the ice cream vendors locating far away 
from each other in order to avoid head-to-head competition. 

There are critics who have argued that such excessive investment in 
product positioning is wasteful. Perhaps this is true in some cases, but then 
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again, “excessive variety” may simpley be a consequence of encouraging 
firms to provide consumers with a variety of products from which to choose. 


25.10 More Vendors 

We have shown that if there are two vendors whose market areas overlap, 
and each seller sells the same price, they will both end up located at the 
“middle” of the boardwalk. What happens if there are more than two 
vendors who compete in their location? 

The next easiest case is that of three vendors. This case gives rise to 
a rather peculiar outcome: there may be no equilibrium location pattern! 
To see this, look at Figure 25.8. If there are three vendors located on the 
boardwalk, there must be one located between the other two. As before, it 
pays each of the “outside” vendors to move towards the middle vendor since 
they can steal some of its customers without losing any of their own. But 
if they get too close to the other vendor, it pays it to jump immediately to 
the right of its right-hand competitor or immediately to the left of its left- 
hand competitor to steal its market. No matter what the location pattern, 
it pays someone to move! 



No equilibrium. There is no pure strategy equilibrium in the 
Hotelling model with 3 firms since for any configuration, at least 
one firm wants to change location. 


Luckily, this “perverse” result only holds in the case of three competitors. 
If there are four or more competitors, an equilibrium location pattern will 
generally emerge. 


Figure 

25.8 
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Summary 

1. There will typically be an incentive for a monopolist to engage in price 
discrimination of some sort. 

2. Perfect price discrimination involves charging each customer a different 
take-it-or-leave-it price. This will result in an efficient level of output. 

3. If a firm can charge different prices in two different markets, it will tend 
to charge the lower price in the market with the more elastic demand. 

4. If a firm can set a two-part tariff, and consumers are identical, then it 
will generally want to set price equal to marginal cost and make all of its 
profits from the entry fee. 

5. The industry structure known as monopolistic competition refers to a 
situation in which there is product differentiation, so each firm has some 
degree of monopoly power, but there is also free entry so that profits are 
driven to zero. 

6. Monopolistic competition can result in too much or too little product 
differentiation in general. 


REVIEW QUESTIONS 


1. Will a monopoly ever provide a Pareto efficient level of output on its 
own? 

2. Suppose that a monopolist sells to two groups that have constant elas¬ 
ticity demand curves, with elasticity e± and C 2 - The marginal cost of pro¬ 
duction is constant at c. What price is charged to each group? 

3. Suppose that the amusement park owner can practice perfect first-degree 
price discrimination by charging a different price for each ride. Assume that 
all rides have zero marginal cost and all consumers have the same tastes. 
Will the monopolist do better charging for rides and setting a zero price 
for admission, or better by charging for admission and setting a zero price 
for rides? 

4. Disneyland also offers a discount on admissions to residents of Southern 
California. (You show them your zip code at the gate.) What kind of 
price discrimination is this? What does this imply about the elasticity of 
demand for Disney attractions by Southern Californians? 


CHAPTER 



FACTOR 

MARKETS 


In our examination of factor demands in Chapter 19 we only considered 
the case of a firm that faced a competitive output market and a competi¬ 
tive factor market. Now that we have studied monopoly behavior, we can 
examine some alternative specifications of factor demand behavior. For ex¬ 
ample, what happens to factor demands if a firm behaves as a monopolist 
in its output market? Or what happens to factor demands if a firm is the 
sole demander for the use of some factors? We investigate these questions 
and some related questions in this chapter. 


26.1 Monopoly in the Output Market 

When a firm determines its profit-maximizing demand for a factor, it will 
always want to choose a quantity such that the marginal revenue from 
hiring a little more of that factor just equals the marginal cost of doing 
so. This follows from the standard logic: if the marginal revenue of some 
action didn’t equal the marginal cost of that action, then it would pay for 
the firm to change the action. 
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This general rule takes various special forms depending on our assump¬ 
tions about the environment in which the firm operates. For example, 
suppose that the firm has a monopoly for its output. For simplicity we will 
suppose that there is only one factor of production and write the produc¬ 
tion function as y — /(*)■ The revenue that the firm receives depends on 
its production of output so we write R{y) = p(y)y , where p(y) is the inverse 
demand function. Let us see how a marginal increase in the amount of the 
input affects the revenues of the firm. 

Suppose that we increase the amount of the input a little bit, Ax. This 
will result in a small increase in output, Ay. The ratio of the increase in 
output to the increase in the input is the marginal product of the factor: 


= A y = /(x + Ax) - f(x) 
Ax Ax 


(26.1) 


This increase in output will cause revenue to change. The change in 
revenue is called the marginal revenue. 


MR y = 


A R 
Ay 


R(y + Ay) - R(y) 
Ay 


(26.2) 


The effect on revenue due to the marginal increase in the input is called 
the marginal revenue product. Examining equations (26.1) and (26.2) 
we see that it is given by 


MRP X 


A R 
Ax 


AR Ay 
Ay Ax 


= MRy x MP X . 


We can use our standard expression for marginal revenue to write this as 

A v 

MRP X = p(y) + -A-y MP X 
= p{y) 1 + - MP X 


i + i 

e 


= p(y ) 


l - 


l 


MP X . 


The first expression is the usual expression for marginal revenue. The 
second and third expressions use the elasticity form of marginal revenue, 
which was discussed in Chapter 15. 

Now it is easy to see how this generalizes the competitive case we ex¬ 
amined earlier in Chapter 19. The elasticity of the demand curve facing 
an individual firm in a competitive market is infinite; consequently the 
marginal revenue for a competitive firm is just equal to price. Hence the 
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“marginal revenue product” of an input for a firm in a competitive market 
is just the value of the marginal product of that input, pMP x . 

How does the marginal revenue product (in the case of a monopoly) 
compare to the value of the marginal product? Since the demand curve 
has a negative slope, we see that the marginal revenue product will always 
be less than the value of the marginal product: 


MRP X = p 



MP X < pMP x . 


As long as the demand function is not perfectly elastic, the MRP X will be 
strictly less than pMP x . This means that at any level of employment of 
the factor, the marginal value of an additional unit is less for a monopolist 
than for a competitive firm. In the rest of this section we will assume that 
we are dealing with this case—the case where the monopolist actually has 
some monopoly power. 

At first encounter this statement seems paradoxical since a monopolist 
makes higher profits than a competitive firm. In this sense the total factor 
input is “worth more” to a monopolist than to a competitive firm. 

The resolution of this “paradox” is to note the difference between total 
value and marginal value. The total amount employed of the factor is 
indeed worth more to the monopolist than to the competitive firm since 
the monopolist will make more profits from the factor than the competitive 
firm. However, at a given level of output an increase in the employment 
of the factor will increase output and reduce the price that a monopolist 
is able to charge. But an increase in a competitive firm’s output will not 
change the price it can charge. Thus on the margin, a small increase in 
the employment of the factor is worth less to the monopolist than to the 
competitive firm. 

Since increases in the factor employment are worth less to a monopo¬ 
list than to a competitive firm on the margin in the short run, it makes 
sense that the monopolist would usually want to employ less of the input. 
Indeed this is generally true: the monopolist increases its profits by reduc¬ 
ing its output, and so it will usually hire lower amounts of inputs than a 
competitive firm. 

In order to determine how much of the factor a firm employs, we have 
to compare the marginal revenue of an additional unit of the factor to the 
marginal cost of hiring that factor. Let us assume that the firm operates 
in a competitive factor market, so that it can hire as much of the factor as 
it wants at a constant price of w. In this case, the competitive firm wants 
to hire x c units of the factor, where 


pMP(x c ) = w. 

The monopolist, on the other hand, wants to hire x m units of the factor, 
where 


MRP(x m ) = w. 
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Figure 

26.1 


We have illustrated this in Figure 26.1. Since MRP(x) < pMP(x ), the 
point where MRP(x m ) = w will always be to the left of the point where 
pMP(x c ) = w. Hence the monopolist will hire less than the competitive 
firm. 



Factor demand by a monopolist. Since the marginal rev¬ 
enue product curve (MRP) lies beneath the curve measuring the 
value of the marginal product (pMP), the factor demand by a 
monopolist must be less than the factor demand by the same 
firm if it behaves competitively. 


26.2 Monopsony 

In a monopoly there is a single seller of a commodity. In a monopsony 
there is a single buyer. The analysis of a monopsonist is similar to that of 
a monopolist. For simplicity, we suppose that the buyer produces output 
that will be sold in a competitive market. 

As above, we will suppose that the firm produces output using a single 
factor according to the production function y = f{x). However, unlike the 
discussion above, we suppose that the firm dominates the factor market in 
which it operates and recognizes the amount of the factor that it demands 
will influence the price that it has to pay for this factor. 









MONOPSONY 489 


We summarize this relationship by the (inverse) supply curve w(x). The 
interpretation of this function is that if the firm wants to hire x units of the 
factor it must pay a price of w(x). We assume that w[x) is an increasing 
function: the more of the x-factor the firm wants to employ, the higher 
must be the factor price it offers. 

A firm in a competitive factor market by definition faces a flat factor 
supply curve: it can hire as much as it wants at the going factor price. 
A monopsonist faces an upward-sloping factor supply curve: the more it 
wants to hire, the higher a factor price it must offer. A firm in a competitive 
factor market is a price taker. A monopsonist is a price maker. 

The profit-maximization problem facing the monopsonist is 

max pf(x) — w(x)x. 

X 

The condition for profit maximization is that the marginal revenue from 
hiring an extra unit of the factor should equal the marginal cost of that 
unit. Since we have assumed a competitive output market the marginal 
revenue is simply pMP x . What about the marginal cost? 

The total change in costs from hiring As more of the factor will be 

Ac = w Ax + xAw, 


so that the change in costs per unit change in As is 


Ac 

As 


» rsy A w 

= MC X = w + ——s. 

Ax 


The interpretation of this expression is similar to the interpretation of the 
marginal revenue expression: when the firm increases its employment of the 
factor it has to pay wAx more in payment to the factor. But the increased 
demand for the factor will push the factor price up by Ait;, and the firm 
has to pay this higher price on all of the units it was previously employing. 

We can also write the marginal cost of hiring additional units of the 
factor as 


MC X = w 


x Aw 
w Ax 


1 + 


1 

V 


where if is the supply elasticity of the factor. Since supply curves typically 
slope upward, if will be a positive number. If the supply curve is perfectly 
elastic, so that if is infinite, this reduces to the case of a firm facing a 
competitive factor market. Note the similarity of these observations with 
the analogous case of a monopolist. 

Let’s analyze the case of a monopsonist facing a linear supply curve for 
the factor. The inverse supply curve has the form 


w(x) = a + bx , 
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Figure 

26.2 


so that total costs have the form 

C(x ) = w(x)x = ax + bx 2 , 

and thus the marginal cost of an additional unit of the input is 

MC X ( x) = a + 2 bx. 

The construction of the monopsony solution is given in Figure 26.2. We 
find the position where the value of the marginal product equals marginal 
cost to determine x* and then see what the factor price must be at that 
point. 



Monopsony. The firm operates where the marginal revenue 
from hiring an extra unit of the factor equals the marginal cost 
of that extra unit. 


Since the marginal cost of hiring an extra unit of the factor exceeds the 
factor price, the factor price will be lower than if the firm had faced a 
competitive factor market. Too little of the factor will be hired relative to 
the competitive market. Just as in the case of the monopoly, a monopsonist 
operates at a Pareto inefficient point. But the inefficiency now lies in the 
factor market rather than in the output market. 
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EXAMPLE: The Minimum Wage 

Suppose that the labor market is competitive and that the government 
sets a minimum wage that is higher than the prevailing equilibrium wage. 
Since demand equals supply at the equilibrium wage, the supply of labor 
will exceed the demand for labor at the higher minimum wage. This is 
depicted in Figure 26.3A. 




Minimum wage. Panel A shows the effect of a minimum wage 
in a competitive labor market. At the competitive wage, w c , em¬ 
ployment would be L c . At the minimum wage, w, employment 
is only L mw . Panel B shows the effect of a minimum wage in 
a monopsonized labor market. Under monopsony, the wage is 
w m and employment is L m , which is less than the employment 
in the competitive labor market. If the minimum wage is set to 
w c , employment will increase to L c . 


Figure 

26.3 


Things are very different if the labor market is dominated by a monop- 
sonist. In this case, it is possible that imposing a minimum wage may 
actually increase employment. This is depicted in Figure 26.3B. If the 
government sets the minimum wage equal to the wage that would prevail 
in a competitive market, the “monopsonist” now perceives that it can hire 
workers at a constant wage of w c . Since the wage rate it faces is now in¬ 
dependent of how many workers it hires, it will hire until the value of the 
marginal product equals w c . That is, it will hire just as many workers as 
if it faced a competitive labor market. 

Setting a wage floor for a monopsonist is just like setting a price ceiling 
for a monopolist; each policy makes the firm behave as though it faced a 
competitive market. 
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26.3 Upstream and Downstream Monopolies 

We have now examined two cases involving imperfect competition and fac¬ 
tor markets: the case of a firm with a monopoly in the output market but 
facing a competitive factor market, and the case of a firm with a com¬ 
petitive output market that faces a monopolized factor market. Other 
variations are possible. The firm could face a monopoly seller in its factor 
market for example. Or it could face a monopsony buyer in its output 
market. It doesn’t make much sense to plod through each possible case; 
they quickly become repetitive. However, we will examine one interesting 
market structure in which a monopoly produces output that is used as a 
factor of production by another monopolist. 

Suppose then that one monopolist produces output x at a constant mar¬ 
ginal cost of c. We call this monopolist the upstream monopolist. It 
sells the x-factor to another monopolist, the downstream monopolist 
at a price of k. The downstream monopolist uses the x-factor to produce 
output y according to the production function y = f(x). This output is 
then sold in a monopolist market in which the inverse demand curve is 
p(y). For purposes of this example, we consider a linear inverse demand 
curve p(y) = a — by. 

To make things simple, think of the production function as just being 
y = x, so that for each unit of the x-input, the monopolist can produce one 
unit of the y-output. We further suppose that the downstream monopolist 
has no costs of production other than the unit price k that it must pay to 
the upstream monopolist. 

In order to see how this market works, start with the downstream mo¬ 
nopolist. Its profit-maximization problem is 

max p(y)y - ky = [a - by]y - ky. 
v 

Setting marginal revenue equal to marginal cost, we have 


a — 2 by = k , 


which implies that 

a — k 


Since the monopolist demands one unit of the x-input for each y-output 
that it produces, this expression also determines the factor demand function 


x = 


a — k 
2b 


(26.3) 


This function tells us the relationship between the factor price k and the 
amount of the factor that the downstream monopolist will demand. 
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Turn now to the problem of the upstream monopolist. Presumably it 
understands this process and can determine how much of the rr-good it will 
sell if it sets various prices k: this is simply the factor demand function 
given in equation (26.3). The upstream monopolist wants to choose x to 
maximize its profit. 

We can determine this level easily enough. Solving equation (26.3) for k 
as a function of x we have 

k = a — 2 bx. 

The marginal revenue associated with this factor demand function is 


MR = a — 4 bx. 

Setting marginal revenue equal to marginal cost we have 

a — 4 bx = c, 


or 

a — c 


Since the production function is simply y = x, this also gives us the total 
amount of the final product that is produced: 

y= a -^- ( 26 - 4 ) 

It is of interest to compare this to the amount that would be produced 
by a single integrated monopolist. Suppose that the upstream and the 
downstream firm merged so that we had one monopolist who faced an 
output inverse demand function p = a — by and faced a constant marginal 
cost of c per unit produced. The marginal revenue equals marginal cost 
equation is 

a — 2 by = c, 

which implies that the profit-maximizing output is 


y = 


a — c 
2b 


(26.5) 


Comparing equation (26.4) to equation (26.5) we see that the integrated 
monopolist produces twice as much output as the nonintegrated monopo¬ 
lists. 

This is depicted in Figure 26.4. The final demand curve facing the down¬ 
stream monopolist p(y), and the marginal revenue curve associated with 
this demand function is itself the demand function facing the upstream mo¬ 
nopolist. The marginal revenue curve associated with this demand function 
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Figure 

26.4 



Upstream and downstream monopoly. The downstream 
monopolist faces the (inverse) demand curve p{y). The mar¬ 
ginal revenue associated with this demand curve is MRu(y). 
This in turn is the demand curve facing the upstream monop¬ 
olist, and the associated marginal revenue curve is MRjj(y). 
The integrated monopolist produces at y*\ the nonintegrated 
monopolist produces at y* m . 


is therefore four times as steep as the final demand curve—which is why 
the output in this market is half what it would be in the integrated market. 

Of course the fact that the final marginal revenue curve is exactly four 
times as steep is particular to the linear demand case. However, it is not 
hard to see that an integrated monopolist will always produce more than an 
upstream-downstream pair of monopolists. In the latter case the upstream 
monopolist raises its price above its marginal cost and then the downstream 
monopolist raises its price above this already marked-up cost. There is a 
double markup. The price is not only too high from a social point of view, 
it is too high from the viewpoint of maximizing total monopoly profits! If 
the two monopolists merged, price would go down and profits would go up. 


Summary 

1. A profit-maximizing firm always wants to set the marginal revenue of 
each action it takes equal to the marginal cost of that action. 


2. In the case of a monopolist, the marginal revenue associated with an 
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increase in the employment of a factor is called the marginal revenue prod¬ 
uct. 

3. For a monopolist, the marginal revenue product will always be smaller 
than the value of the marginal product due to the fact that the marginal 
revenue from increasing output is always less than price. 

4. Just as a monopoly consists of a market with a single seller, a monopsony 
consists of a market with a single buyer. 

5. For a monopsonist the marginal cost curve associated with a factor will 
be steeper than the supply curve of that factor. 

6. Hence a monopsonist will hire an inefficiently small amount of the factor 
of production. 

7. If an upstream monopolist sells a factor to a downstream monopolist, 
then the final price of output will be too high due to the double markup 
phenomenon. 


REVIEW QUESTIONS 


1. We saw that a monopolist never produced where the demand for output 
was inelastic. Will a monopsonist produce where a factor is inelastically 
supplied? 

2. In our example of the minimum wage, what would happen if the labor 
market was dominated by a monopsonist and the government set a wage 
that was above the competitive wage? 

3. In our examination of the upstream and downstream monopolists we de¬ 
rived expressions for the total output produced. What are the appropriate 
expressions for the equilibrium prices, p and kl 


APPENDIX 

We can calculate marginal revenue product by using the chain rule. Let y = f(x) 
be the production function and p(y) be the inverse demand function. Revenue 
as a function of the factor employment is just 


R{x) =p{f(x))f{x). 
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Differentiating this expression with respect to x we have 
dR ^ = p(y)f'{x) + f{x)p'{y)f'{x) 

= \p{y) + p (y)y\f (*) 

= MR x MP. 

Let us examine the behavior of a firm that is a competitor in its output market 
and a monopsonist in its factor market. Letting w(x) be the inverse factor supply 
function, the profit-maximization problem is 

max pf(x) — w(x)x. 


Differentiating with respect to x, we have 


Pf'(x) 


w(x) + w'(x)x 


w(x) 


1 x dw 
w dx. 


w(x) 



Since the factor supply curve slopes upward, the right-hand side of this expression 
will be larger than w. Hence the monopsonist will choose to employ less of the 
factor than would a firm that behaves competitively in the factor market. 







CHAPTER 



OLIGOPOLY 


We have now investigated two important forms of market structure: pure 
competition, where there are typically many small competitors, and pure 
monopoly, where there is only one large firm in the market. However, 
much of the world lies between these two extremes. Often there are a 
number of competitors in the market, but not so many as to regard each 
of them as having a negligible effect on price. This is the situation known 
as oligopoly. 

The model of monopolistic competition described in Chapter 24 is a 
special form of oligopoly that emphasizes issues of product differentiation 
and entry. However, the models of oligopoly that we will study in this 
chapter are more concerned with the strategic interactions that arise in an 
industry with a small number of firms. 

There are several models that are relevant since there are several different 
ways for firms to behave in an oligopolistic environment. It is unreason¬ 
able to expect one grand model since many different behavior patterns can 
be observed in the real world. What we want is a guide to some of the 
possible patterns of behavior and some indication of what factors might be 
important in deciding when the various models are applicable. 
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For simplicity, we will usually restrict ourselves to the case of two firms; 
this is called a situation of duopoly. The duopoly case allows us to cap¬ 
ture many of the important features of firms engaged in strategic interaction 
without the notational complications involved in models with a larger num¬ 
ber of firms. Also, we will limit ourselves to investigation of cases in which 
each firm is producing an identical product. This allows us to avoid the 
problems of product differentiation and focus only on strategic interactions. 


27.1 Choosing a Strategy 

If there are two firms in the market and they are producing a homogeneous 
product, then there are four variables of interest: the price that each firm 
charges and the quantities that each firm produces. 

When one firm decides about its choices for prices and quantities it may 
already know the choices made by the other firm. If one firm gets to set its 
price before the other firm, we call it the price leader and the other firm 
the price follower. Similarly, one firm may get to choose its quantity first, 
in which case it is a quantity leader and the other is a quantity follower. 
The strategic interactions in these cases form a sequential game. 1 

On the other hand, it may be that when one firm makes its choices it 
doesn’t know the choices made by the other firm. In this case, it has to 
guess about the other firm’s choice in order to make a sensible decision 
itself. This is a simultaneous game. Again there are two possibilities: 
the firms could each simultaneously choose prices or each simultaneously 
choose quantities. 

This classification scheme gives us four possibilities: quantity leadership, 
price leadership, simultaneous quantity setting, and simultaneous price set¬ 
ting. Each of these types of interaction gives rise to a different set of 
strategic issues. 

There is also another possible form of interaction that we will examine. 
Instead of the firms competing against each other in one form or another 
they may be able to collude. In this case the two firms can jointly agree 
to set prices and quantities that maximize the sum of their profits. This 
sort of collusion is called a cooperative game. 


EXAMPLE: Pricing Matching 

It is common to see advertisements where the vendor offers to “meet or 
beat” any price. These are generally considered to be a sign of intensely 


i 


We will examine game theory in more detail in the next chapter, but it seems appro¬ 
priate to introduce these specific examples here. 
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competitive market. However, such offers can also be used as a way to 
dampen competition. 

Suppose there are two tire stores, East Side Tires and West Side Tires, 
that are advertising the same brand tire for $50. 

If East Side Tires cuts its advertised price to $45 while the West Side 
price stays at $50, we would expect that some of those customers on the 
west side of town would be willing to travel a few extra minutes in order 
to save $5. East Side Tires would then sell more tires at a lower price. If 
the increase in sales was large enough to overcome the price reduction, its 
profits would increase. 

That, in a nutshell, is the basic logic of competition: if customers are 
sufficiently sensitive to price, then a seller that cuts its price enjoys a surge 
in sales and an increase in profit. 

But instead of actually cutting its price, suppose instead that West Side 
Tires continued to charge $50 but added a promise to match any lower 
price. What happens now if East Side cuts its advertised price? 

In this case, those who find West Side Tires more convenient can just 
bring in the East Side ad and get the discounted price. Then, East Side 
Tires attracts no new customers from its price cut. In fact, it loses revenue 
since it sells essentially the same number of tires at a lower price. 

The moral: a vendor that offers a low-price guarantee takes away much 
of its competitors’ motivation for cutting prices. 


27.2 Quantity Leadership 


In the case of quantity leadership, one firm makes a choice before the other 
firm. This is sometimes called the Stackelberg model in honor of the 
first economist who systematically studied leader-follower interactions. 2 

The Stackelberg model is often used to describe industries in which there 
is a dominant firm, or a natural leader. For example, IBM is often con¬ 
sidered to be a dominant firm in the computer industry. A commonly 
observed pattern of behavior is for smaller firms in the computer industry 
to wait for IBM’s announcements of new products and then adjust their 
own product decisions accordingly. In this case we might want to model 
the computer industry with IBM playing the role of a Stackelberg leader, 
and the other firms in the industry being Stackelberg followers. 

Let us turn now to the details of the theoretical model. Suppose that 
firm 1 is the leader and that it chooses to produce a quantity y\. Firm 2 
responds by choosing a quantity y 2 ■ Each firm knows that the equilibrium 
price in the market depends on the total output produced. We use the 


2 


Heinrich von Stackelberg was a German economist who published his influential work 
on market organization, Marktform und Gleichgewicht , in 1934. 
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inverse demand function p(Y) to indicate the equilibrium price as a function 
of industry output, Y = y 1 + 2/2- 

What output should the leader choose to maximize its profits? The 
answer depends on how the leader thinks that the follower will react to its 
choice. Presumably the leader should expect that the follower will attempt 
to maximize profits as well, given the choice made by the leader. In order 
for the leader to make a sensible decision about its own production, it has 
to consider the follower’s profit-maximization problem. 


The Follower's Problem 

We assume that the follower wants to maximize its profits 
max p(y 1 + y 2 )y 2 ~ c 2 (y 2 ). 

V2 

The follower’s profit depends on the output choice of the leader, but from 
the viewpoint of the follower the leader’s output is predetermined -the 
production by the leader has already been made, and the follower simply 
views it as a constant. 

The follower wants to choose an output level such that marginal revenue 
equals marginal cost: 


A p 

MR 2 = p{yi + y 2 ) + —A 2/2 = MC 2 . 

A3/2 

The marginal revenue has the usual interpretation. When the follower 
increases its output, it increases its revenue by selling more output at the 
market price. But it also pushes the price down by Ap, and this lowers its 
profits on all the units that were previously sold at the higher price. 

The important thing to observe is that the profit-maximizing choice of 
the follower will depend on the choice made by the leader. We write this 
relationship as 

2/2 = /2 (?/ i ) - 

The function f2(1/1) tells us the profit-maximizing output of the follower 
as a function of the leader’s choice. This function is called the reaction 
function since it tells us how the follower will react to the leader’s choice 
of output. 

Let’s derive a reaction curve in the simple case of linear demand. In this 
case the (inverse) demand function takes the form p(yi +y 2 ) = a—b(yi+y 2 ). 
For convenience we’ll take costs to be zero. 

Then the profit function for firm 2 is 


772(2/1,2/2) = [a - 6(2/1 + 2/2)]2/2 
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or 

712(2/1,2/2) = ay 2 - byiy 2 - by 2 . 

We can use this expression to draw the isoprofit lines in Figure 27.1. 
These are lines depicting those combinations of y\ and y 2 that yield a 
constant level of profit to firm 2. That is, the isoprofit lines are comprised 
of all points (2/1,2/2) that satisfy equations of the form 


ayi - byiy 2 - by\ = tt 2 . 


Note that profits to firm 2 will increase as we move to isoprofit lines that 
are further to the left. This is true since if we fix the output of firm 2 at 
some level, firm 2’s profits will increase as firm l’s output decreases. Firm 2 
will make its maximum possible profits when it is a monopolist; that is, 
when firm 1 chooses to produce zero units of output. 



Derivation of a reaction curve. This reaction curve gives 
the profit-maximizing output for the follower, firm 2, for each 
output choice of the leader, firm 1. For each choice of y\ the 
follower chooses the output level f 2 {yi) associated with the iso- 
profit line farthest to the left. 


Figure 

27.1 


For each possible choice of firm l’s output, firm 2 wants to choose its own 
output to make its profits as large as possible. This means that for each 
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choice of y \, firm 2 will pick the value of y 2 that puts it on the isoprofit 
line furthest to the left, as illustrated in Figure 27.1. This point will satisfy 
the usual sort of tangency condition: the slope of the isoprofit line must 
be vertical at the optimal choice. The locus of these tangencies describes 
firm 2’s reaction curve, ^" 2 ( 2 / 1 )- 

To see this result algebraically, we need an expression for the marginal 
revenue associated with the profit function for firm 2. It turns out that 
this expression is given by 


MR 2 (y x ,y 2 ) =a-by 1 - 2 by 2 . 


(This is easy to derive using calculus. If you don’t know calculus, you’ll 
just have to take this statement on faith.) Setting the marginal revenue 
equal to marginal cost, which is zero in this example, we have 


a — byi — 2 by 2 = 0, 


which we can solve to derive firm 2’s reaction curve: 

a - by 1 
V2= ^^- 

This reaction curve is the straight line depicted in Figure 27.1. 


The Leader's Problem 

We have now examined how the follower will choose its output given the 
choice of the leader. We turn now to the leader’s profit-maximization 
problem. 

Presumably, the leader is also aware that its actions influence the output 
choice of the follower. This relationship is summarized by the reaction 
function f 2 (yi). Hence when making its output choice it should recognize 
the influence that it exerts on the follower. 

The profit-maximization problem for the leader therefore becomes 

max p(y 1 + y 2 )y x - c x (yi) 
y 1 

such that y 2 = f 2 {y x ). 

Substituting the second equation into the first gives us 
max p[y x + f 2 (yi)]yi - ci(yi). 

Vl 

Note that the leader recognizes that when it chooses output y x , the to¬ 
tal output produced will be y x + f 2 (y x ): its own output plus the output 
produced by the follower. 
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When the leader contemplates changing its output it has to recognize 
the influence it exerts on the follower. Let’s examine this in the context of 
the linear demand curve described above. There we saw that the reaction 
function was given by 


/ 2 G/ 1 ) =V 2 = a ^ ■ (27.1) 

Since we’ve assumed that marginal costs are zero, the leader’s profits are 
7Ti(yi, 2 / 2 ) = p{yi + 2 / 2)221 = ay I - by\ - byiy 2 . (27.2) 

But the output of the follower, y 2l will depend on the leader’s choice via 
the reaction function y 2 = f 2 (yi). 

Substituting from equation (27.1) into equation (27.2) we have 

7 Ti(yi,y2) = ayi - by\ - by 1 f 2 {yi) 

2 u a ~by 1 
= ay 1 - by ± - by 1 ———. 


Simplifying this expression gives us 

m(yi,y 2 ) = ^ 2/1 - ^yl- 


The marginal revenue for this function is 

MR = ^ - byi- 


Setting this equal to marginal cost, which is zero in this example, and 
solving for yi gives us 


In order to find the follower’s output, we simply substitute y\ into the 
reaction function, 


2/2 = 


a ~ byl 
2 b 


a 

46' 


These two equations give a total industry output of 2 /* + y 2 = 3a /4b. 

The Stackelberg solution can also be illustrated graphically using the 
isoprofit curves depicted in Figure 27.2. (This figure also illustrates the 
Cournot equilibrium which will be described in section 27.5.) Here we 
have illustrated the reaction curves for both firms and the isoprofit curves 
for firm 1. The isoprofit curves for firm 1 have the same general shape as 
the isoprofit curves for firm 2; they are simply rotated 90 degrees. Higher 
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Figure 

27.2 



Stackelberg equilibrium. Firm 1, the leader, chooses the 
point on firm 2’s reaction curve that touches firm l’s lowest 
possible isoprofit line, thus yielding the highest possible profits 
for firm 1. 


profits for firm 1 are associated with isoprofit curves that are lower down 
since firm l’s profits will increase as firm 2’s output decreases. 

Firm 2 is behaving as a follower, which means that it will choose an 
output along its reaction curve, / 2 (yi). Thus firm 1 wants to choose an 
output combination on the reaction curve that gives it the highest possible 
profits. But the highest possible profits means picking that point on the 
reaction curve that touches the lowest isoprofit line, as illustrated in Figure 
27.2. It follows by the usual logic of maximization that the reaction curve 
must be tangent to the isoprofit curve at this point. 


27.3 Price Leadership 


Instead of setting quantity, the leader may instead set price. In order to 
make a sensible decision about how to set its price, the leader must forecast 
how the follower will behave. Accordingly, we must first investigate the 
profit-maximization problem facing the follower. 

The first thing we observe is that in equilibrium the follower must always 
set the same price as the leader. This follows from our assumption that the 
two firms are selling identical products. If one charged a different price from 
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the other, all of the consumers would prefer the producer with the lower 
price, and we couldn’t have an equilibrium with both firms producing. 

Suppose that the leader has set a price p. We will suppose that the 
follower takes this price as given and chooses its profit-maximizing output. 
This is essentially the same as the competitive behavior we investigated 
earlier. In the competitive model, each firm takes the price as being outside 
of its control because it is such a small part of the market; in the price- 
leadership model, the follower takes the price as being outside of its control 
since it has already been set by the leader. 

The follower wants to maximize profits: 


max py 2 - 02 ( 3 / 2 )- 
2/2 

This leads to the familiar condition that the follower will want to choose 
an output level where price equals marginal cost. This determines a supply 
curve for the follower, S(p), which we have illustrated in Figure 27.3. 



Price leader. The demand curve facing the leader is the 
market demand curve minus the follower’s supply curve. The 
leader equates marginal revenue and marginal cost to find the 
optimal quantity to supply, y* L . The total amount supplied to 
the market is y^ and the equilibrium price is p*. 


Figure 

27.3 


Turn now to the problem facing the leader. It realizes that if it sets 
a price p, the follower will supply S(p). That means that the amount of 
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output the leader will sell will be R(p) = D(p) — S(p). This is called the 
residual demand curve facing the leader. 

Suppose that the leader has a constant marginal cost of production c. 
Then the profits that it achieves for any price p are given by: 

ttiO) = (P - c)[D{p) - S(p)\ = {p- c)R(p). 

In order to maximize profits the leader wants to choose a price and output 
combination where marginal revenue equals marginal cost. However, the 
marginal revenue should be the marginal revenue for the residual demand 
curve—the curve that actually measures how much output it will be able to 
sell at each given price. In Figure 27.3 the residual demand curve is linear; 
therefore the marginal revenue curve associated with it will have the same 
vertical intercept and be twice as steep. 

Let’s look at a simple algebraic example. Suppose that the inverse de¬ 
mand curve is D(jp ) = a—bp. The follower has a cost function 02(2/2) = 3/2 /2, 
and the leader has a cost function ci(y±) = cy±. 

For any price p the follower wants to operate where price equals marginal 
cost. If the cost function is 02(2/2) = 2/2/2, it can be shown that the marginal 
cost curve is M62(2/2) = 2/2- Setting price equal to marginal cost gives us 


P = 2/2- 

Solving for the follower’s supply curve gives 2/2 = S(p) = p. 

The demand curve facing the leader- -the residual demand curve—is 

R{p) = D(p) — S(p) = a — bp — p = a — (6 + 1 )p. 


From now on this is just like an ordinary monopoly problem. Solving for 
p as a function of the leader’s output 2/1, we have 


P = 


a 

b + 1 


b + 1 


2/i- 


(27.3) 


This is the inverse demand function facing the leader. The associated 
marginal revenue curve has the same intercept and is twice as steep. This 
means that it is given by 


MRi 


a 2 

6 + 1 ~~ b+l Vl ' 


Setting marginal revenue equal to marginal cost gives us the equation 

a 2 


MRi = 


-2/1 = c = MC 1. 


6+1 6 + 1 ' 

Solving for the leader’s profit-maximizing output, we have 

* _ a — c(6 + 1) 

2/l 


We could go on and substitute this into equation (27.3) to get the equilib¬ 
rium price, but the equation is not particularly interesting. 
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27.4 Comparing Price Leadership and Quantity Leadership 

We’ve seen how to calculate the equilibrium price and output in the case of 
quantity leadership and price leadership. Each model determines a different 
equilibrium price and output combination; each model is appropriate in 
different circumstances. 

One way to think about quantity setting is to think of the firm as making 
a capacity choice. When a firm sets a quantity it is in effect determining 
how much it is able to supply to the market. If one firm is able to make 
an investment in capacity first, then it is naturally modeled as a quantity 
leader. 

On the other hand, suppose that we look at a market where capacity 
choices are not important but one of the firms distributes a catalog of 
prices. It is natural to think of this firm as a price setter. It’s rivals may 
then take the catalog price as given and make their own pricing and supply 
decision accordingly. 

Whether the price-leadership or the quantity-leadership model is appro¬ 
priate is not a question that can be answered on the basis of pure theory. 
We have to look at how the firms actually make their decisions in order to 
choose the most appropriate model. 


27.5 Simultaneous Quantity Setting 

One difficulty with the leader-follower model is that it is necessarily asym¬ 
metric: one firm is able to make its decision before the other firm. In some 
situations this is unreasonable. For example, suppose that two firms are 
simultaneously trying to decide what quantity to produce. Here each firm 
has to forecast what the other firm’s output will be in order to make a 
sensible decision itself. 

In this section we will examine a one-period model in which each firm 
has to forecast the other firm’s output choice. Given its forecast, each firm 
then chooses a profit-maximizing output for itself. We then seek an equi¬ 
librium in forecasts—a situation where each firm finds its beliefs about the 
other firm to be confirmed. This model is known as the Cournot model, 
after the nineteenth-century French mathematician who first examined its 
implications. 3 

We begin by assuming that firm 1 expects that firm 2 will produce 
units of output. (The e stands for expected output.) If firm 1 decides to 
produce y\ units of output, it expects that the total output produced will 


3 Augustin Cournot (pronounced “core-no”) was born in 1801. His book, Researches 
into the Mathematical Principles of the Theory of Wealth, was published in 1838. 
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be Y — j/i + yf, and output will yield a market price of p(Y) = p(yi + yf). 
The profit-maximization problem of firm 1 is then 

maxp(yx + yf)yi - c(yi). 
y i 

For any given belief about the output of firm 2, yf, there will be some 
optimal choice of output for firm 1, z/i. Let us write this functional rela¬ 
tionship between the expected output of firm 2 and the optimal choice of 
firm 1 as 

Vi = h{yl)- 

This function is simply the reaction function that we investigated earlier 
in this chapter. In our original treatment the reaction function gave the 
follower’s output as a function of the leader’s choice. Here the reaction 
function gives one firm’s optimal choice as a function of its beliefs about 
the other firm’s choice. Although the interpretation of the reaction function 
is different in the two cases, the mathematical definition is exactly the same. 

Similarly, we can derive firm 2’s reaction curve: 

V2 = fziyt), 

which gives firm 2’s optimal choice of output for a given expectation about 
firm l’s output, yf. 

Now, recall that each firm is choosing its output level assuming that the 
other firm’s output will be at yf or yf. For arbitrary values of yf and yf 
this won’t happen—in general firm l’s optimal level of output, yi, will be 
different from what firm 2 expects the output to be, yf. 

Let us seek an output combination (yf, yf) such that the optimal output 
level for firm 1, assuming firm 2 produces yf, is yf and the optimal output 
level for firm 2, assuming that firm 1 stays at yf, is yf. In other words, the 
output choices (yf,yf) satisfy 


yl = /i(z/ 2 *) 


2 / 2 * = hiyl). 

Such a combination of output levels is known as a Cournot equilib¬ 
rium. In a Cournot equilibrium, each firm is maximizing its profits, given 
its beliefs about the other firm’s output choice, and, furthermore, those 
beliefs are confirmed in equilibrium: each firm optimally chooses to pro¬ 
duce the amount of output that the other firm expects it to produce. In a 
Cournot equilibrium neither firm will find it profitable to change its output 
once it discovers the choice actually made by the other firm. 

An example of a Cournot equilibrium is given in Figure 27.2. The 
Cournot equilibrium is simply the pair of outputs at which the two reaction 
curves cross. At such a point, each firm is producing a profit-maximizing 
level of output given the output choice of the other firm. 
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27.6 An Example of Cournot Equilibrium 


Recall the case of the linear demand function and zero marginal costs that 
we investigated earlier. We saw that in this case the reaction function for 
firm 2 took the form 

a-byl 


Since in this example firm 1 is exactly the same as firm 2, its reaction 
curve has the same form: 


V i 


a ~ by% 
2b 


Figure 27.4 depicts this pair of reaction curves. The intersection of the 
two lines gives us the Cournot equilibrium. At this point each firm’s choice 
is the profit-maximizing choice, given its beliefs about the other firm’s be¬ 
havior, and each firm’s beliefs about the other firm’s behavior are confirmed 
by its actual behavior. 



Cournot equilibrium. Each firm is maximizing its profits, 
given its beliefs about the other firm’s output decision. The 
Cournot equilibrium is at (y \, y %), where the two reaction curves 
cross. 


Figure 

27.4 
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In order to calculate the Cournot equilibrium algebraically, we look for 
the point (jq, j/ 2 ) where each firm is doing what the other firm expects it to 
do. We set y\ = y\ and y 2 = y 2 , which gives us the following two equations 
in two unknowns: 

a - by 2 

a - by i 
V2 ~ 2b 

In this example, both firms are identical, so each will produce the same 
level of output in equilibrium. Hence we can substitute y\ = y 2 into one of 
the above equations to get 


V i = 


a - by! 
2b ' 


Solving for y\, we get 


Vi = 


3 b' 


Since the two firms are identical, this implies that 


V2 = 


a 
3 b 


as well, and the total industry output is 


V*i +2/2 


2 a 
36' 


27.7 Adjustment to Equilibrium 

We can use Figure 27.4 to describe a process of adjustment to equilibrium. 
Suppose that at time t the firms are producing outputs (y\,y 2 ), which are 
not necessarily equilibrium outputs. If firm 1 expects that firm 2 is going 
to continue to keep its output at y then next period firm 1 would want to 
choose the profit-maximizing output given that expectation, namely fi(y 2 )- 
Thus firm l’s choice in period t + 1 will be given by 

y{ +1 = hivl)- 

Firm 2 can reason the same way, so firm 2’s choice next period will be 

y\ +1 = h{y\). 

These equations describe how each firm adjusts its output in the face 
of the other firm’s choice. Figure 27.4 illustrates the movement of the 





MANY FIRMS IN COURNOT EQUILIBRIUM 511 


outputs of the firms implied by this behavior. Here is the way to interpret 
the diagram. Start with some operating point Given firm 2’s level 

of output, firm 1 optimally chooses to produce y{ +1 = /i( 2 / 2 ) nex t period. 
We find this point in the diagram by moving horizontally to the left until 
we hit firm l’s reaction curve. 

If firm 2 expects firm 1 to continue to produce y\ + , its optimal response 
is to produce y^ +1 ■ We find this point by moving vertically upward until we 
hit firm 2’s reaction function. We continue to move along the “staircase” to 
determine the sequence of output choices of the two firms. In the example 
illustrated, this adjustment process converges to the Cournot equilibrium. 
We say that in this case the Cournot equilibrium is a stable equilibrium. 

Despite the intuitive appeal of this adjustment process, it does present 
some difficulties. Each firm is assuming that the other’s output will be 
fixed from one period to the next, but as it turns out, both firms keep 
changing their output. Only in equilibrium is one firm’s expectation about 
the other firm’s output choice actually satisfied. For this reason, we will 
generally ignore the question of how the equilibrium is reached and focus 
only on the issue of how the firms behave in the equilibrium. 


27.8 Many Firms in Cournot Equilibrium 


Suppose now that we have several firms involved in a Cournot equilibrium, 
not just two. In this case we suppose that each firm has an expectation 
about the output choices of the other firms in the industry and seek to 
describe the equilibrium output. 

Suppose that there are n firms and let Y = y\ + ■ ■ ■ + y n be the total 
industry output. Then the “marginal revenue equals marginal cost condi¬ 
tion” for firm i is 

p( y ) + ^7 Vi = MC {Vi)- 

If we factor out PlY) and multiply the second term by Y/Y, we can write 
this equation as 


p(Y) 


1 1 Ap Y Vi 

A Yp(Y) Y 


MC(y z ). 


Using the definition of elasticity of the aggregate demand curve and letting 
Si = Vi/Y be firm *’s share of total market output, this reduces to 


P(X) 


1 - 


H Y )U 

We can also write this expression as 

1 


= MC( V i). 


p(Y) 


1 - 


<Y)\/ 8i \ 


= MC( V i). 


(27.4) 
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This looks just like the expression for the monopolist except for the .s, term. 
We can think of e(F)/sj as being the elasticity of the demand curve facing 
the firm: the smaller the market share of the firm, the more elastic the 
demand curve it faces. 

If its market share is 1—the firm is a monopolist—the demand curve 
facing the firm is the market demand curve, so the condition just reduces 
to that of the monopolist. If the firm is a very small part of a large market, 
its market share is effectively zero, and the demand curve facing the firm is 
effectively flat. Thus the condition reduces to that of the pure competitor: 
price equals marginal cost. 

This is one justification for the competitive model described in Chap¬ 
ter 22. If there are a large number of firms, then each firm’s influence on 
the market price is negligible, and the Cournot equilibrium is effectively 
the same as pure competition. 


27.9 Simultaneous Price Setting 

In the Cournot model described above we have assumed that firms were 
choosing their quantities and letting the market determine the price. An¬ 
other approach is to think of firms as setting their prices and letting the 
market determine the quantity sold. This model is known as Bertrand 
competition . 4 

When a firm chooses its price, it has to forecast the price set by the other 
firm in the industry. Just as in the case of Cournot equilibrium we want 
to find a pair of prices such that each price is a profit-maximizing choice 
given the choice made by the other firm. 

What does a Bertrand equilibrium look like? When firms are selling 
identical products, as we have been assuming, the Bertrand equilibrium 
has a very simple structure indeed. It turns out to be the competitive 
equilibrium, where price equals marginal cost! 

First we note that price can never be less than marginal cost since then 
either firm would increase its profits by producing less. So let us consider 
the case where price is greater than marginal cost. Suppose that both firms 
are selling output at some price p greater than marginal cost. Consider the 
position of firm 1. If it lowers its price by any small amount e and if the 
other firm keeps its price fixed at p , all of the consumers will prefer to 
purchase from firm 1. By cutting its price by an arbitrarily small amount, 
it can steal all of the customers from firm 2. 

If firm 1 really believes that firm 2 will charge a price p that is greater 
than marginal cost, it will always pay firm 1 to cut its price to p — e. But 
firm 2 can reason the same way! Thus any price higher than marginal 

4 Joseph Bertrand, also a French mathematician, presented his model in a review of 

Cournot’s work. 
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cost cannot be an equilibrium; the only equilibrium is the competitive 
equilibrium. 

This result seems paradoxical when you first encounter it: how can we 
get a competitive price if there are only two firms in the market? If we 
think of the Bertrand model as a model of competitive bidding it makes 
more sense. Suppose that one firm “bids” for the consumers’ business by 
quoting a price above marginal cost. Then the other firm can always make 
a profit by undercutting this price with a lower price. It follows that the 
only price that each firm cannot rationally expect to be undercut is a price 
equal to marginal cost. 

It is often observed that competitive bidding among firms that are unable 
to collude can result in prices that are much lower than can be achieved 
by other means. This phenomenon is simply an example of the logic of 
Bertrand competition. 


27.10 Collusion 

In the models we have examined up until now the firms have operated 
independently. But if the firms collude so as to jointly determine their 
output, these models are not very reasonable. If collusion is possible, the 
firms would do better to choose the output that maximizes total industry 
profits and then divide up the profits among themselves. When firms get 
together and attempt to set prices and outputs so as to maximize total 
industry profits, they are known as a cartel. As we saw in Chapter 24, a 
cartel is simply a group of firms that jointly collude to behave like a single 
monopolist and maximize the sum of their profits. 

Thus the profit-maximization problem facing the two firms is to choose 
their outputs y\ and y 2 so as to maximize total industry profits: 

max p(y 1 + y 2 )[yi + 2 / 2 ] - Ci(yi) - c 2 (y 2 ). 
y 1 , 2/2 


This will have the optimality conditions 

p(yt + y*2) + ^[y*i + y*2) = MC 1 ( y * 1 ) 

p(y* + y*2 ) + Ivl + vl) = mc 2 ( v * 2 ). 

The interpretation of these conditions is interesting. When firm 1 consid¬ 
ers expanding its output by Ay%, it will contemplate the usual two effects: 
the extra profits from selling more output and the reduction in profits from 
forcing the price down. But in the second effect, it now takes into account 
the effect of the lower price on both its own output and the output of the 
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other firm. This is because it is now interested in maximizing total industry 
profits, not just its own profits. 

The optimality conditions imply that the marginal revenue of an extra 
unit of output must be the same no matter where it is produced. It follows 
that MCi(yl) = M 02 ( 3 ) 2 ), so that the two marginal costs will be equal in 
equilibrium. If one firm has a cost advantage, so that its marginal cost curve 
always lies below that of the other firm, then it will necessarily produce 
more output in equilibrium in the cartel solution. 

The problem with agreeing to join a cartel in real life is that there is 
always a temptation to cheat. Suppose, for example, that the two firms 
are operating at the outputs that maximize industry profits (j/i, 2 / 2 ) an< -l 
firm 1 considers producing a little more output, Ayi. The marginal profits 
accruing to firm 1 will be 


^ = PiVi + Vi) + ^pf/J - MC,{yl). 


(27.5) 


We saw earlier that the optimality condition for the cartel solution is 

PiVi + Va) + ^Vl + ^ - MC^yl) = 0. 

Rearranging this equation gives us 

P(V*1 + i/ 5 ) + ^rVl - MCM = ~^y* 2 > 0. (27.6) 


The last inequality follows since Ap/AY is negative, since the market de¬ 
mand curve has a negative slope. 

Inspecting equations (27.5) and (27.6) we see that 


A-7Tl 

Ayr 


> 0 . 


Thus, if firm 1 believes that firm 2 will keep its output fixed, then it will 
believe that it can increase profits by increasing its own production. In the 
cartel solution, the firms act together to restrict output so as not to “spoil” 
the market. They recognize the effect on joint profits from producing more 
output in either firm. But if each firm believes that the other firm will 
stick to its output quota, then each firm will be tempted to increase its 
own profits by unilaterally expanding its output. At the output levels 
that maximize joint profits, it will always be profitable for each firm to 
unilaterally increase its output—if each firm expects that the other firm 
will keep its output fixed. 

The situation is even worse than that. If firm 1 believes that firm 2 
will keep its output fixed, then it will find it profitable to increase its 
own output. But if it thinks that firm 2 will increase its output, then 
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firm 1 would want increase its output first and make its profits while it 
can! 

Thus, in order to maintain an effective cartel, the firms need a way to 
detect and punish cheating. If they have no way to observe each other’s 
output, the temptation to cheat may break the cartel. We’ll return to this 
point a little later. 

To make sure that we understand the cartel solution, let’s calculate it 
for the case of zero marginal costs and the linear demand curve we used in 
the Cournot case. 

The aggregate profit function will be 

7 r (yi ) 2/2) = [a - b(yi + y 2 )\(yi + 2/2) = a(yi + y 2 ) - %i + y 2 ) 2 , 

so the marginal revenue equals marginal cost conditions will be 

a-2b(y* 1 +y* 2 ) = 0, 

which implies that 

1 ^ 

yi + y* = 21 - 

Since marginal costs are zero, the division of output between the two firms 
doesn’t matter. All that is determined is the total level of industry output. 

This solution is shown in Figure 27.5. Here we have illustrated the 
isoprofit curves for each of the firms and have highlighted the locus of 
common tangents. Why is this line of interest? Since the cartel is trying 
to maximize total industry profits, it follows that the marginal profits from 
having either firm produce more output must be the same—otherwise it 
would pay to have the more profitable firm produce more output. This in 
turn implies that the slopes of the isoprofit curves must be the same for 
each firm; that is, that the isoprofit curves must be tangent to each other. 
Hence the output combinations that maximize total industry profits—the 
cartel solution—are those that lie along the line illustrated in Figure 27.5. 

Figure 27.5 also illustrates the temptation to cheat that is present at the 
cartel solution. Consider, for example, the point where the two firms split 
the market equally. Think about what would happen if firm 1 believed that 
firm 2 would keep its output constant. If firm 1 increased its output and 
firm 2 kept constant output, then firm 1 would move to a lower isoprofit 
curve—which means that firm 1 would increase its profits. This is exactly 
the story told in the algebra above. If one firm thinks that the other’s 
output will remain constant, it will be tempted to increase its own output 
and thereby make higher profits. 


27.11 Punishment Strategies 

We have seen that a cartel is fundamentally unstable in the sense that it 
is always in the interest of each of the firms to increase their production 
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Figure 

27.5 



A cartel. If industry profits are maximized, then the marginal 
profit from producing more output in either firm must be the 
same. This implies that the isoprofit curves must be tangent to 
one another at the profit-maximizing levels of output. 


above that which maximizes aggregate profit. If the cartel is to operate 
successfully, some way must be found to “stabilize” the behavior. One way 
to do this is for firms to threaten to punish each other for cheating on the 
cartel agreement. In this section, we investigate the size of punishments 
necessary to stabilize a cartel. 

Consider a duopoly composed of two identical firms. If each firm pro¬ 
duces half the monopoly amount of output, total profits will be maximized 
and each firm will get a payoff of, say, 7r m . In an effort to make this outcome 
stable, one firm announces to the other: “If you stay at the production level 
that maximizes joint industry projects, fine. But if I discover you cheat¬ 
ing by producing more than this amount, I will punish you by producing 
the Cournot level of output forever.” This is known as a punishment 
strategy. 

When will this sort of threat be adequate to stabilize the cartel? We 
have to look at the benefits and costs of cheating as compared to those of 
cooperating. Suppose that cheating occurs, and the punishment is carried 
out. Since the optimal response to Cournot behavior is Cournot behavior 
(by definition), this results in each firm receiving a per-period profit of, say, 
7 r c . Of course, the Cournot payoff, tt c is less than the cartel payoff, n m . 

Let us suppose that the two firms are each producing at the collusive, 






PUNISHMENT STRATEGIES 517 


monopoly level of production. Put yourself in the place of one of the 
firms trying to decide whether to continue to produce at your quota. If 
you produce more output, deviating from your quota, you make profit 7 r^, 
where ird > 7r m . This is the standard temptation facing a cartel member 
described above: if each firm restricts output and pushes the price up, then 
each firm has an incentive to capitalize on the high price by increasing its 
production. 

But this isn’t the end of the story because of the punishment for cheating. 
By producing at the cartel amount, each firm gets a steady stream of 
payments of n m . The present value of this stream starting today is given 
by ^ 

Present value of cartel behavior = 7r m H——. 

r 

If the firm produces more than the cartel amount, it gets a one-time benefit 
of profits 7 Td, but then has to live with the breakup of the cartel and the 
reversion to Cournot behavior: 

7 T C 

Present value of cheating = 7r<H-. 

r 


When will the present value of remaining at the cartel output 
than the present value of cheating on the cartel agreement? 
when 


^ . 7T c 

- > 7T d H-, 


r 


r 


be greater 
Obviously 


which can also be written as 


1 \ 

TTra 

Note that the numerator of this fraction is positive, since the monopoly 
profits are larger than the Cournot profits, and the denominator is positive, 
since deviation is even more profitable than sticking with the monopoly 
quota. 

The inequality says that as long as the interest rate is sufficiently small, 
so that the prospect of future punishment is sufficiently important, it will 
pay the firms to stick to their quotas. 

The weakness of this model is that the threat to revert to Cournot be¬ 
havior forever is not very believable. One firm certainly may believe that 
the other will punish it for deviating, but “forever” is a long time. A more 
realistic model would consider shorter periods of retaliation, but the analy¬ 
sis then becomes much more complex. In the next chapter, we discuss some 
models of “repeated games” that illustrate some of the possible behaviors. 


EXAMPLE: Price Matching and Competition 

We have seen that there is always a temptation for each member of a 
cartel to produce more than its quota. In order to maintain a successful 
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cartel, some way must be found to police members’ behavior by some form 
of punishment for deviations from the joint profit-maximizing output. In 
particular this means that firms must be able to keep track of the prices 
and production levels of the other firms in the cartel. 

One easy way to acquire information about what the other firms in your 
industry are charging is to use your customers to spy on the other firms. 
It is common to see retail firms announce that they will “beat any price.” 
In some cases, such an offer may indicate a highly competitive retail en¬ 
vironment. But in other cases, this same policy can be used to gather 
information about other firms’ prices in order to maintain a cartel. 

Suppose, for example, that two firms agree, either explicitly or implicitly 
to sell a certain model of refrigerator for $700. How can either of the stores 
be sure that the other firm isn’t cheating on their agreement and selling 
the refrigerator for $675? One way is to offer to beat any price a customer 
can find. That way, the customers report any attempts to cheat on the 
collusive arrangement. 


EXAMPLE: Voluntary Export Restraints 

During the 1980s, the Japanese automobile companies agreed to a “volun¬ 
tary export restraint (VER).” This meant that they would “voluntarily” 
reduce the exports of their automobiles to the United States. The typ¬ 
ical U.S. consumer thought that this was a great victory for U.S. trade 
negotiators. 

But if you think about this for a minute, things look quite different. In 
our examination of oligopoly we have seen that the problem facing firms in 
an industry is how to restrict output in order to support higher prices and 
discourage competition. As we’ve seen, there will always be a temptation 
to cheat on production agreements; every cartel must find a way to detect 
and prevent this cheating. It is especially convenient for the firms if a third 
party, such as the government, can serve this role. This is exactly the role 
that the U.S. government played for the Japanese auto makers! 

According to one estimate Japanese imported cars were about $2500 
more expensive in 1984 than they would have been without the VERs. 
Furthermore, the higher prices of imported cars allowed American produc¬ 
ers to sell their automobiles at about $1000 more than they would have 
otherwise. 5 

Due to these higher prices the U.S. consumers paid about $10 billion 
more for Japanese cars in 1985-86 than they would have otherwise. This 
money has gone directly into the pockets of the Japanese automobile pro¬ 
ducers. Much of this additional profit appears to have been invested in 

5 Robert Crandall, “Import Quotas and the Automobile Industry: the Costs of Pro¬ 
tectionism,” The Brookings Review, Summer, 1984. 
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increasing productive capabilities, which allowed the Japanese auto pro¬ 
ducers to reduce the cost of producing new cars in subsequent years. The 
VERs did succeed in saving American jobs; however, it appears that the 
cost per job saved was about $160,000 per year. 

If the goal of the VER policy was simply to increase the health of the 
American automobile industry, there was a much simpler way to do this: 
just impose a $2500 tariff on each imported Japanese car. This way the 
revenues due to the restriction of trade would accrue to the U.S. government 
rather than to the Japanese automobile industry. Rather than send $10 
billion abroad during 1985-86, the U.S. government could have spent the 
money on projects designed to increase the long-term health of the U.S. 
auto industry. 


27.12 Comparison of the Solutions 


We have now examined several models of duopoly behavior: quantity 
leadership (Stackelberg), price leadership, simultaneous quantity setting 
(Cournot), simultaneous price setting (Bertrand), and the collusive solu¬ 
tion. How do they compare? 

In general, collusion results in the smallest industry output and the high¬ 
est price. Bertrand equilibrium—the competitive equilibrium—gives us the 
highest output and the lowest price. The other models give results that are 
in between these two extremes. 

A variety of other models are possible. For example, we could look at a 
model with differentiated products where the two goods produced were not 
perfect substitutes for each other. Or we could look at a model where the 
firms make a sequence of choices over time. In this framework, the choices 
that one firm makes at one time can influence the choices that the other 
firm makes later on. 

We have also assumed that each firm knows the demand function and the 
cost functions of the other firms in the industry. In reality these functions 
are never known for sure. Each firm needs to estimate the demand and 
cost conditions facing its rivals when it makes its own decisions. All of 
these phenomena have been modeled by economists, but the models become 
much more complex. 


Summary 

1. An oligopoly is characterized by a market with a few firms that rec¬ 
ognize their strategic interdependence. There are several possible ways 
for oligopolies to behave depending on the exact nature of their interac¬ 
tion. 
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2. In the quantity-leader (Stackelberg) model one firm leads by setting its 
output, and the other firm follows. When the leader chooses an output, it 
will take into account how the follower will respond. 

3. In the price-leader model, one firm sets its price, and the other firm 
chooses how much it wants to supply at that price. Again the leader has 
to take into account the behavior of the follower when it makes its deci¬ 
sion. 

4. In the Cournot model, each firm chooses its output so as to maximize 
its profits given its beliefs about the other firm’s choice. In equilibrium 
each firm finds that its expectation about the other firm’s choice is con¬ 
firmed. 

5. A Cournot equilibrium in which each firm has a small market share 
implies that price will be very close to marginal cost—that is, the industry 
will be nearly competitive. 

6 . In the Bertrand model each firm chooses its price given its beliefs about 
the price that the other firm will choose. The only equilibrium price is the 
competitive equilibrium. 

7. A cartel consists of a number of firms colluding to restrict output and to 
maximize industry profit. A cartel will typically be unstable in the sense 
that each firm will be tempted to sell more than its agreed upon output if 
it believes that the other firms will not respond. 


REVIEW QUESTIONS 


1. Suppose that we have two firms that face a linear demand curve p(Y) = 
a — bY and have constant marginal costs, c, for each firm. Solve for the 
Cournot equilibrium output. 

2. Consider a cartel in which each firm has identical and constant marginal 
costs. If the cartel maximizes total industry profits, what does this imply 
about the division of output between the firms? 

3. Can the leader ever get a lower profit in a Stackelberg equilibrium than 
he would get in the Cournot equilibrium? 

4. Suppose there are n identical firms in a Cournot equilibrium. Show that 
the absolute value of the elasticity of the market demand curve must be 
greater than 1/n. (Hint: in the case of a monopolist, n = 1, and this 
simply says that a monopolist operates at an elastic part of the demand 
curve. Apply the logic that we used to establish that fact to this problem.) 
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5. Draw a set of reaction curves that result in an unstable equilibrium. 

6 . Do oligopolies produce an efficient level of output? 


CHAPTER 



GAME 

THEORY 


The previous chapter on oligopoly theory presented the classical economic 
theory of strategic interaction among firms. But that is really just the tip 
of the iceberg. Economic agents can interact strategically in a variety of 
ways, and many of these have been studied by using the apparatus of game 
theory. Game theory is concerned with the general analysis of strategic 
interaction. It can be used to study parlor games, political negotiation, and 
economic behavior. In this chapter we will briefly explore this fascinating 
subject to give you a flavor of how it works and how it can be used to study 
economic behavior in oligopolistic markets. 


28.1 The Payoff Matrix of a Game 

Strategic interaction can involve many players and many strategies, but 
we’ll limit ourselves to two-person games with a finite number of strategies. 
This will allow us to depict the game easily in a payoff matrix. It is 
simplest to examine this in the context of a specific example. 

Suppose that two people are playing a simple game. Person A will write 
one of two words on a piece of paper, “top” or “bottom.” Simultaneously, 
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person B will independently write “left” or “right” on a piece of paper. 
After they do this, the papers will be examined and they will each get 
the payoff depicted in Table 28.1. If A says top and B says left, then we 
examine the top left-hand corner of the matrix. In this matrix the payoff 
to A is the first entry in the box, 1, and the payoff to B is the second entry, 
2. Similarly, if A says bottom and B says right, then A will get a payoff of 
1 and B will get a payoff of 0. 

Person A has two strategies: he can choose top or he can choose bot¬ 
tom. These strategies could represent economic choices like “raise price” or 
“lower price.” Or they could represent political choices like “declare war” 
or “don’t declare war.” The payoff matrix of a game simply depicts the 
payoffs to each player for each combination of strategies that are chosen. 

What will be the outcome of this sort of game? The game depicted in 
Table 28.1 has a very simple solution. From the viewpoint of person A, it 
is always better for him to say bottom since his payoffs from that choice 
(2 or 1) are always greater than their corresponding entries in top (1 or 
0). Similarly, it is always better for B to say left since 2 and 1 dominate 
1 and 0. Thus we would expect that the equilibrium strategy is for A to 
play bottom and B to play left. 

In this case, we have a dominant strategy. There is one optimal choice 
of strategy for each player no matter what the other player does. Whichever 
choice B makes, player A will get a higher payoff if he plays bottom, so 
it makes sense for A to play bottom. And whichever choice A makes, B 
will get a higher payoff if he plays left. Hence, these choices dominate the 
alternatives, and we have an equilibrium in dominant strategies. 


A payoff matrix of a game. 



Player B 

left 

Right 

Player A 

Top 

1,2 

0, 1 

Bottom 

2, 1 

1 , 0 


Table 

28.1 


If there is a dominant strategy for each player in some game, then we 
would predict that it would be the equilibrium outcome of the game. For 
a dominant strategy is a strategy that is best no matter what the other 
player does. In this example, we would expect an equilibrium outcome in 
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which A plays bottom, receiving an equilibrium payoff of 2, and B plays 
left, receiving an equilibrium payoff of 1. 


28.2 Nash Equilibrium 

Dominant strategy equilibria are nice when they happen, but they don’t 
happen all that often. For example, the game depicted in Table 28.2 doesn’t 
have a dominant strategy equilibrium. Here when B chooses left the payoffs 
to A are 2 or 0. When B chooses right, the payoffs to A are 0 or 1. This 
means that when B chooses left, A would want to choose top; and when B 
chooses right, A would want to choose bottom. Thus A’s optimal choice 
depends on what he thinks B will do. 


Table 

28.2 


A Nash equilibrium. 



Player B 

left 

Right 

Player A 

Top 

2, 1 

0 , 0 

Bottom 

0 , 0 

1,2 


However, perhaps the dominant strategy equilibrium is too demanding. 
Rather than require that A’s choice be optimal for all choices of B, we 
can just require that it be optimal for the optimal choices of B. For if B 
is a well-informed intelligent player, he will only want to choose optimal 
strategies. (Although, what is optimal for B will depend on A’s choice as 
well!) 

We will say that a pair of strategies is a Nash equilibrium if A’s choice 
is optimal, given B’s choice, and B’s choice is optimal given A’s choice. * 1 
Remember that neither person knows what the other person will do when 
he has to make his own choice of strategy. But each person may have 

1 John Nash is an American mathematician who formulated this fundamental concept 
of game theory in 1951. In 1994 he received the Nobel Prize in economics, along with 
two other game theory pioneers, John Harsanyi and Reinhard Selten. The 2002 film 
A Beautiful Mind is loosely based on John Nash’s life; it won the Academy Award 
for best movie. 
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some expectation about what the other person’s choice will be. A Nash 
equilibrium can be interpreted as a pair of expectations about each per¬ 
son’s choice such that, when the other person’s choice is revealed, neither 
individual wants to change his behavior. 

In the case of Table 28.2, the strategy (top, left) is a Nash equilibrium. 
To prove this note that if A chooses top, then the best thing for B to do 
is to choose left, since the payoff to B from choosing left is 1 and from 
choosing right is 0. And if B chooses left, then the best thing for A to do 
is to choose top since then A will get a payoff of 2 rather than of 0. 

Thus if A chooses top, the optimal choice for B is to choose left; and if 
B chooses left, then the optimal choice for A is top. So we have a Nash 
equilibrium: each person is making the optimal choice, given the other 
person’s choice. 

The Nash equilibrium is a generalization of the Cournot equilibrium 
described in the last chapter. There the choices were output levels, and 
each firm chose its output level taking the other firm’s choice as being 
fixed. Each firm was supposed to do the best for itself, assuming that the 
other firm continued to produce the output level it had chosen—that is, it 
continued to play the strategy it had chosen. A Cournot equilibrium occurs 
when each firm is maximizing profits given the other firm’s behavior; this 
is precisely the definition of a Nash equilibrium. 

The Nash equilibrium notion has a certain logic. Unfortunately, it also 
has some problems. First, a game may have more than one Nash equilib¬ 
rium. In fact, in Table 28.2 the choices (bottom, right) also comprise a 
Nash equilibrium. You can either verify this by the kind of argument used 
above, or just note that the structure of the game is symmetric: B’s payoffs 
are the same in one outcome as A’s payoffs are in the other, so that our 
proof that (top, left) is an equilibrium is also a proof that (bottom, right) 
is an equilibrium. 

The second problem with the concept of a Nash equilibrium is that there 
are games that have no Nash equilibrium of the sort we have been describing 
at all. Consider, for example, the case depicted in Table 28.3. Here a Nash 
equilibrium of the sort we have been examining does not exist. If player A 
plays top, then player B wants to play left. But if player B plays left, then 
player A wants bottom. Similarly, if player A plays bottom, then player B 
will play right. But if player B plays right, then player A will play top. 


28.3 Mixed Strategies 


However, if we enlarge our definition of strategies, we can find a new sort 
of Nash equilibrium for this game. We have been thinking of each agent 
as choosing a strategy once and for all. That is, each agent is making one 
choice and sticking to it. This is called a pure strategy. 
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Table 

28.3 


A game with no Nash equilibrium (in pure strategies). 



Player B 

Left 

Right 

Player A 

Top 

0 , 0 

0 , -1 

Bottom 

1,0 

-1, 3 


Another way to think about it is to allow the agents to randomize their 
strategies—to assign a probability to each choice and to play their choices 
according to those probabilities. For example, A might choose to play top 
50 percent of the time and bottom 50 percent of the time, while B might 
choose to play left 50 percent of the time and right 50 percent of the time. 
This kind of strategy is called a mixed strategy. 

If A and B follow the mixed strategies given above, of playing each of 
their choices half the time, then they will have a probability of 1/4 of ending 
up in each of the four cells in the payoff matrix. Thus the average payoff 
to A will be 0, and the average payoff to B will be 1/2. 

A Nash equilibrium in mixed strategies refers to an equilibrium in which 
each agent chooses the optimal frequency with which to play his strategies 
given the frequency choices of the other agent. 

It can be shown that for the sort of games we are analyzing in this chap¬ 
ter, there will always exist a Nash equilibrium in mixed strategies. Because 
a Nash equilibrium in mixed strategies always exists, and because the con¬ 
cept has a certain inherent plausibility, it is a very popular equilibrium 
notion in analyzing game behavior. In the example in Table 28.3 it can 
be shown that if player A plays top with probability 3/4 and bottom with 
probability 1/4, and player B plays left with probability 1/2 and right with 
probability 1/2, this will constitute a Nash equilibrium. 


EXAMPLE: Rock Paper Scissors 

But enough of this theory. Let’s look at an example that really matters: 
the well-known pastime of “rock paper scissors.” In this game, each player 
simultaneously chooses to display a fist (rock), a palm (paper), or his first 
two fingers (scissors). The rules: rock breaks scissors, scissors cuts paper, 
paper wraps rock. 

Throughout history, countless hours have been spent in playing this 
game. There is even a professional society, the RPS Society, that pro- 














THE PRISONER'S DILEMMA 527 


motes the game. It offer both a Web site and a movie documenting the 
2003 championships in Toronto. 

Of course, game theorists recognize that the equilibrium strategy in rock 
paper scissors is to randomly choose one of the three outcomes. But humans 
are not necessarily so good at choosing totally random outcomes. If you 
can predict your opponent’s choices to some degree, you can have an edge 
in making your own choices. 

According to the somewhat tongue-in-cheek account of Jennifer 8. Lee, 
psychology is paramount. 2 In her article she writes that “most people have 
a go-to throw, reflective of their character, when they are caught off guard. 
Paper, considered a refined, even passive, throw, is apparently favored by 
literary types and journalists.” 

What is the go-to throw of economists, I wonder? Perhaps it is scissors, 
since we like to cut to the essential forces at work in human behavior. 
Should you play rock against an economist, then? Perhaps, but I wouldn’t 
rely on it ... 


28.4 The Prisoner's Dilemma 

Another problem with the Nash equilibrium of a game is that it does 
not necessarily lead to Pareto efficient outcomes. Consider, for example, 
the game depicted in Table 28.4. This game is known as the prisoner’s 
dilemma. The original discussion of the game considered a situation where 
two prisoners who were partners in a crime were being questioned in sep¬ 
arate rooms. Each prisoner had a choice of confessing to the crime, and 
thereby implicating the other, or denying that he had participated in the 
crime. If only one prisoner confessed, then he would go free, and the au¬ 
thorities would throw the book at the other prisoner, requiring him to 
spend 6 months in prison. If both prisoners denied being involved, then 
both would be held for 1 month on a technicality, and if both prisoners 
confessed they would both be held for 3 months. The payoff matrix for 
this game is given in Table 28.4. The entries in each cell in the matrix 
represent the utility that each of the agents assigns to the various prison 
terms, which for simplicity we take to be the negative of the length of their 
prison terms. 

Put yourself in the position of player A. If player B decides to deny 
committing the crime, then you are certainly better off confessing, since 
then you’ll get off free. Similarly, if player B confesses, then you’ll be 
better off confessing, since then you get a sentence of 3 months rather than 
a sentence of 6 months. Thus whatever player B does, player A is better 
off confessing. 


2 


Yes, “8” really is her middle name. “Rock, Paper, Scissors: High Drama in the 
Tournament Ring” was published in the New York Times on September 5, 2004. 
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Table 

28.4 


The prisoner’s dilemma. 



Player B 

Confess 

Deny 

Player A 

Confess 

-3,-3 

0, -6 

Deny 

-6, 0 

- 1 , -1 


The same thing goes for player B—he is better off confessing as well. 
Thus the unique Nash equilibrium for this game is for both players to 
confess. In fact, both players confessing is not only a Nash equilibrium, it 
is a dominant strategy equilibrium, since each player has the same optimal 
choice independent of the other player. 

But if they could both just hang tight, they would each be better off! If 
they both could be sure the other would hold out, and both could agree to 
hold out themselves, they would each get a payoff of —1, which would make 
each of them better off. The strategy (deny, deny) is Pareto efficient—there 
is no other strategy choice that makes both players better off —while the 
strategy (confess, confess) is Pareto inefficient. 

The problem is that there is no way for the two prisoners to coordinate 
their actions. If each could trust the other, then they could both be made 
better off. 

The prisoner’s dilemma applies to a wide range of economic and political 
phenomena. Consider, for example, the problem of arms control. Interpret 
the strategy of “confess” as “deploy a new missile” and the strategy of 
“deny” as “don’t deploy.” Note that the payoffs are reasonable. If my 
opponent deploys his missile, I certainly want to deploy, even though the 
best strategy for both of us is to agree not to deploy. But if there is no 
way to make a binding agreement, we each end up deploying the missile 
and are both made worse off. 

Another good example is the problem of cheating in a cartel. Now in¬ 
terpret confess as “produce more than your quota of output” and interpret 
deny as “stick to the original quota.” If you think the other firm is going 
to stick to its quota, it will pay you to produce more than your own quota. 
And if you think that the other firm will overproduce, then you might as 
well, too! 

The prisoner’s dilemma has provoked a lot of controversy as to what is the 
“correct” way to play the game—or, more precisely, what is a reasonable 
way to play the game. The answer seems to depend on whether you are 
playing a one-shot game or whether the game is to be repeated an indefinite 
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number of times. 

If the game is going to be played just one time, the strategy of defecting— 
in this example, confessing—seems to be a reasonable one. After all, what¬ 
ever the other fellow does, you are better off, and you have no way of 
influencing the other person’s behavior. 


28.5 Repeated Games 

In the preceding section, the players met only once and played the prisoner’s 
dilemma game a single time. However, the situation is different if the game 
is to be played repeatedly by the same players. In this case there are new 
strategic possibilities open to each player. If the other player chooses to 
defect on one round, then you can choose to defect on the next round. Thus 
your opponent can be “punished” for “bad” behavior. In a repeated game, 
each player has the opportunity to establish a reputation for cooperation, 
and thereby encourage the other player to do the same. 

Whether this kind of strategy will be viable depends on whether the 
game is going to be played a fixed number of times or an indefinite number 
of times. 

Let us consider the first case, where both players know that the game 
is going to be played 10 times, say. What will the outcome be? Suppose 
we consider round 10. This is the last time the game will be played, by 
assumption. In this case, it seems likely that each player will choose the 
dominant strategy equilibrium, and defect. After all, playing the game for 
the last time is just like playing it once, so we should expect the same 
outcome. 

Now consider what will happen on round 9. We have just concluded that 
each player will defect on round 10. So why cooperate on round 9? If you 
cooperate, the other player might as well defect now and exploit your good 
nature. Each player can reason the same way, and thus each will defect. 

Now consider round 8. If the other person is going to defect on round 
9 ... and so it goes. If the game has a known, fixed number of rounds, 
then each player will defect on every round. If there is no way to enforce 
cooperation on the last round, there will be no way to enforce cooperation 
on the next to the last round, and so on. 

Players cooperate because they hope that cooperation will induce further 
cooperation in the future. But this requires that there will always be the 
possibility of future play. Since there is no possibility of future play in 
the last round, no one will cooperate then. But then why should anyone 
cooperate on the next to the last round? Or the one before that? And so 
it goes—the cooperative solution “unravels” from the end in a prisoner’s 
dilemma with a known, fixed number of plays. 

But if the game is going to be repeated an indefinite number of times, 
then you do have a way of influencing your opponent’s behavior: if he 
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refuses to cooperate this time, you can refuse to cooperate next time. As 
long as both parties care enough about future payoffs, the threat of non¬ 
cooperation in the future may be sufficient to convince people to play the 
Pareto efficient strategy. 

This has been demonstrated in a convincing way in a series of experi¬ 
ments run by Robert Axelrod. 3 He asked dozens of experts on game theory 
to submit their favorite strategies for the prisoner’s dilemma and then ran 
a “tournament” on a computer to pit these strategies against each other. 
Every strategy was played against every other strategy on the computer, 
and the computer kept track of the total payoffs. 

The winning strategy—the one with the highest overall payoff -turned 
out to be the simplest strategy. It is called “tit for tat” and goes like this. 
On the first round, you cooperate—play the “deny” strategy. On every 
round thereafter, if your opponent cooperated on the previous round, you 
cooperate. If your opponent defected on the previous round, you defect. 
In other words, do whatever the other player did in the last round. 

The tit-for-tat strategy does very well because it offers an immediate 
punishment for defection. It is also a forgiving strategy: it punishes the 
other player only once for each defection. If he falls into line and starts to 
cooperate, then tit for tat will reward the other player with cooperation. 
It appears to be a remarkably good mechanism for achieving the efficient 
outcome in a prisoner’s dilemma that will be played an indefinite number 
of times. 


28.6 Enforcing a Cartel 

In Chapter 27 we discussed the behavior of duopolists playing a price¬ 
setting game. We argued there that if each duopolist could choose his 
price, then the equilibrium outcome would be the competitive equilibrium. 
If each firm thought that the other firm would keep its price fixed, then 
each firm would find it profitable to undercut the other. The only place 
where this would not be true was if each firm were charging the lowest 
possible price, which in the case we examined was a price of zero, since 
the marginal costs were zero. In the terminology of this chapter, each firm 
charging a zero price is a Nash equilibrium in pricing strategies—what we 
called a Bertrand equilibrium in Chapter 27. 

The payoff matrix for the duopoly game in pricing strategies has the 
same structure as the prisoner’s dilemma. If each firm charges a high price, 
then they both get large profits. This is the situation where they are both 
cooperating to maintain the monopoly outcome. But if one firm is charging 


3 Robert Axelrod is a political scientist from the University of Michigan. For an ex¬ 
tended discussion, see his book The Evolution of Cooperation (New York: Basic 
Books, 1984). 
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a high price, then it will pay the other firm to cut its price a little, capture 
the other fellow’s market, and thereby get even higher profits. But if both 
firms cut their prices, they both end up making lower profits. Whatever 
price the other fellow is charging, it will always pay you to shave your price 
a little bit. The Nash equilibrium occurs when each fellow is charging the 
lowest possible price. 

However, if the game is repeated an indefinite number of times, there 
may be other possible outcomes. Suppose that you decide to play tit for 
tat. If the other fellow cuts his price this week, you will cut yours next 
week. If each player knows that the other player is playing tit for tat, 
then each player would be fearful of cutting his price and starting a price 
war. The threat implicit in tit for tat may allow the firms to maintain high 
prices. 

Real-life cartels sometimes appear to employ tit-for-tat strategies. For 
example, the Joint Executive Committee was a famous cartel that set the 
price of railroad freight in the United States in the late 1800s. The forma¬ 
tion of this cartel preceded antitrust regulation in the United States, and 
at the time was perfectly legal. 4 

The cartel determined what market share each railroad could have of 
the freight shipped. Each firm set its rates individually, and the JEC kept 
track of how much freight each firm shipped. However, there were several 
occasions during 1881, 1884, and 1885 where some members of the cartel 
thought that other member firms were cutting rates so as to increase their 
market share, despite their agreement. During these periods, there were 
often price wars. When one firm tried to cheat, all firms would cut their 
prices so as to “punish” the defectors. This kind of tit-for-tat strategy was 
apparently able to support the cartel arrangement for some time. 


EXAMPLE: Tit for Tat in Airline Pricing 

Airline pricing provides an interesting example of tit-for-tat behavior. Air¬ 
lines often offer special promotional fares of one sort or another; many 
observers of the airline industry claim that these promotions can be used 
to signal competitors to refrain from cutting prices on key routes. 

A senior director of marketing for a major U.S. airline described a case in 
which Northwest lowered fares on night flights from Minneapolis to various 
West Coast cities in an effort to fill empty seats. Continental Airlines 
interpreted this as an attempt to gain market share at its expense and 
responded by cutting all its Minneapolis fares to Northwest’s night-fare 


4 For a detailed analysis, see Robert Porter, “A Study of Cartel Stability: the Joint 
Executive Committee, 1880-1886,” The Bell Journal of Economics , 14, 2 (Autumn 
1983), 301-25. 
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level. However, the Continental fare cuts were set to expire one or two 
days after they were introduced. 

Northwest interpreted this as a signal from Continental that it was not 
serious about competing in this market, but simply wanted Northwest to 
retract its night-fare cuts. But Northwest decided to send a message of its 
own to Continental: it instituted a set of cheap fares to the West Coast 
for its flights departing from Houston, Continental’s home base! North¬ 
west thereby signaled that it felt its cuts were justified, while Continental’s 
response was inappropriate. 

All these fare cuts had very short expiration dates; this feature seems to 
indicate that they were meant more as messages to the competition than as 
bids for larger market share. As the analyst explained, fares that an airline 
doesn’t want to offer “should almost always have an expiration date on 
them in the hopes that the competition will eventually wake up and match.” 

The implicit rules of competition in duopoly airline markets seem to be 
the following: if the other firm keeps its prices high, I will maintain my 
high prices; but if the other firm cuts its prices, I will play tit for tat and 
cut my prices in response. In other words, both firms “live by the Golden 
Rule”: do unto others as you would have them do unto you. This threat 
of retaliation then serves to keep all prices high. 5 


28.7 Sequential Games 


Up until now we have been thinking about games in which both players 
act simultaneously. But in many situations one player gets to move first, 
and the other player responds. An example of this is the Stackelberg model 
described in Chapter 27, where one player is a leader and the other player 
is a follower. 

Let’s describe a game like this. In the first round, player A gets to choose 
top or bottom. Player B gets to observe the first player’s choice and then 
chooses left or right. The payoffs are illustrated in a game matrix in Table 
28.5. 

Note that when the game is presented in this form it has two Nash 
equilibria: (top, left) and (bottom, right). However, we’ll show below that 
one of these equilibria isn’t really reasonable. The payoff matrix hides the 
fact that one player gets to know what the other player has chosen before 
he makes his choice. In this case it is more useful to consider a diagram 
that illustrates the asymmetric nature of the game. 

Figure 28.1 is a picture of the game in extensive form -a way to rep¬ 
resent the game that shows the time pattern of the choices. First, player A 

5 Facts taken from A. Nomani, “Fare Warning: How Airlines Trade Price Plans,” Wall 

Street Journal, October 9, 1990, Bl. 
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The payoff matrix of a sequential game. 



Player B 

Left 

Right 

Player A 

Top 

1,9 

1 , 9 

Bottom 

0 , 0 

2, 1 


Table 

28.5 



A, 

1 , 


1 , 


0 , 


2 , 


Extensive form of the game. This way of depicting a game 
indicates the order in which the players move. 


Figure 

28.1 


has to choose top or bottom, and then player B has to choose left or right. 
But when B makes his choice, he will know what A has done. 

The way to analyze this game is to go to the end and work backward. 
Suppose that player A has already made his choice and we are sitting in 
one branch of the game tree. If player A has chosen top, then it doesn’t 
matter what player B does, and the payoff is (1,9). If player A has chosen 
bottom, then the sensible thing for player B to do is to choose right, and 
the payoff is (2,1). 

Now think about player A’s initial choice. If he chooses top, the outcome 
will be (1,9) and thus he will get a payoff of 1. But if he chooses bottom, he 
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gets a payoff of 2. So the sensible thing for him to do is to choose bottom. 
Thus the equilibrium choices in the game will be (bottom, right), so that 
the payoff to player A will be 2 and to player B will be 1. 

The strategies (top, left) are not a reasonable equilibrium in this sequen¬ 
tial game. That is, they are not an equilibrium given the order in which 
the players actually get to make their choices. It is true that if player A 
chooses top, player B could choose left— but it would be silly for player A 
to ever choose top! 

From player B’s point of view this is rather unfortunate, since he ends 
up with a payoff of 1 rather than 9! What might he do about it? 

Well, he can threaten to play left if player A plays bottom. If player A 
thought that player B would actually carry out this threat, he would be 
well advised to play top. For top gives him 1, while bottom—if player B 
carries out his threat—will only give him 0. 

But is this threat credible? After all, once player A makes his choice, 
that’s it. Player B can get either 0 or 1, and he might as well get 1. Unless 
player B can somehow convince player A that he will really carry out his 
threat—even when it hurts him to do so—he will just have to settle for the 
lower payoff. 

Player B’s problem is that once player A has made his choice, player A 
expects player B to do the rational thing. Player B would be better off if 
he could commit himself to play left if player A plays bottom. 

One way for B to make such a commitment is to allow someone else to 
make his choices. For example, B might hire a lawyer and instruct him 
to play left if A plays bottom. If A is aware of these instructions, the 
situation is radically different from his point of view. If he knows about 
B’s instructions to his lawyer, then he knows that if he plays bottom he 
will end up with a payoff of 0. So the sensible thing for him to do is to play 
top. In this case B has done better for himself by limiting his choices. 


28.8 A Game of Entry Deterrence 

In our examination of oligopoly we took the number of firms in the industry 
as fixed. But in many situations, entry is possible. Of course, it is in the 
interest of the firms in the industry to try to prevent such entry. Since 
they are already in the industry, they get to move first and thus have an 
advantage in choosing ways to keep their opponents out. 

Suppose, for example, that we consider a monopolist who is facing a 
threat of entry by another firm. The entrant decides whether or not to 
come into the market, and then the incumbent decides whether or not to 
cut its price in response. If the entrant decides to stay out, it gets a payoff 
of 1 and the incumbent gets a payoff of 9. 

If the entrant decides to come in, then its payoff depends on whether 
the incumbent fights—by competing vigorously—or not. If the incumbent 
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fights, then we suppose that both players end up with 0. On the other 
hand, if the incumbent decides not to fight, we suppose that the entrant 
gets 2 and the incumbent gets 1. 

Note that this is exactly the structure of the sequential game we studied 
earlier, and thus it has a structure identical to that depicted in Figure 28.1. 
The incumbent is player B, while the potential entrant is player A. The 
top strategy is to stay out, and the bottom strategy is to enter. The left 
strategy is to fight and the right strategy is not to fight. As we’ve seen 
in this game, the equilibrium outcome is for the potential entrant to enter 
and the incumbent not to fight. 


ENTRANT, 



The new entry game. This figure depicts the entry game 
with the changed payoffs. 


Figure 

28.1 


The incumbent’s problem is that he cannot precommit himself to fighting 
if the other firm enters. If the other firm enters, the damage is done and 
the rational thing for the incumbent to do is to live and let live. Insofar as 
the potential entrant recognizes this, he will correctly view any threats to 
fight as empty. 

But suppose that the incumbent can purchase some extra production 
capacity that will allow him to produce more output at his current marginal 
cost. Of course, if he remains a monopolist, he won’t want to actually use 
this capacity since he is already producing the profit-maximizing monopoly 
output. 
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But, if the other firm enters, the incumbent will now be able to produce 
so much output that he may well be able to compete much more successfully 
against the new entrant. By investing in the extra capacity, he will lower 
his costs of fighting if the other firm tries to enter. Let us assume that if 
he purchases the extra capacity and if he chooses to fight, he will make a 
profit of 2. This changes the game tree to the form depicted in . 

Now, because of the increased capacity, the threat of fighting is credible. 
If the potential entrant comes into the market, the incumbent will get a 
payoff of 2 if he fights and 1 if he doesn’t; thus the incumbent will rationally 
choose to fight. The entrant will therefore get a payoff of 0 if he enters, and 
if he stays out he will get a payoff of 1. The sensible thing for the potential 
entrant to do is to stay out. 

But this means that the incumbent will remain a monopolist and never 
have to use his extra capacity! Despite this, it is worthwhile for the mo¬ 
nopolist to invest in the extra capacity in order to make credible the threat 
of fighting if a new firm tries to enter the market. By investing in “excess” 
capacity, the monopolist has signaled to the potential entrant that he will 
be able to successfully defend his market. 


Summary 

1. A game can be described by indicating the payoffs to each of the players 
for each configuration of strategic choices they make. 

2. A dominant strategy equilibrium is a set of choices for which each 
player’s choices are optimal regardless of what the other players choose. 

3. A Nash equilibrium is a set of choices for which each player’s choice is 
optimal, given the choices of the other players. 

4. The prisoner’s dilemma is a particular game in which the Pareto efficient 
outcome is strategically dominated by an inefficient outcome. 

5. If a prisoner’s dilemma is repeated an indefinite number of times, then 
it is possible that the Pareto efficient outcome may result from rational 
play. 

6. In a sequential game, the time pattern of choices is important. In these 
games, it can often be advantageous to find a way to precommit to a 
particular line of play. 
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REVIEW QUESTIONS 


1. Consider the tit-for-tat strategy in the repeated prisoner’s dilemma. 
Suppose that one player makes a mistake and defects when he meant to 
cooperate. If both players continue to play tit for tat after that, what 
happens? 

2. Are dominant strategy equilibria always Nash equilibria? Are Nash 
equilibria always dominant strategy equilibria? 

3. Suppose your opponent is not playing her Nash equilibrium strategy. 
Should you play your Nash equilibrium strategy? 

4. We know that the single-shot prisoner’s dilemma game results in a domi¬ 
nant Nash equilibrium strategy that is Pareto inefficient. Suppose we allow 
the two prisoners to retaliate after their respective prison terms. Formally, 
what aspect of the game would this affect? Could a Pareto efficient outcome 
result? 

5. What is the dominant Nash equilibrium strategy for the repeated pris¬ 
oner’s dilemma game when both players know that the game will end after 
one million repetitions? If you were going to run an experiment with hu¬ 
man players for such a scenario, would you predict that players would use 
this strategy? 

6. Suppose that player B rather than player A gets to move first in the 
sequential game described in this chapter. Draw the extensive form of the 
new game. What is the equilibrium for this game? Does player B prefer to 
move first or second? 


CHAPTER 



GAME 

APPLICATIONS 


In the last chapter we described a number of important concepts in game 
theory and illustrated them using a few examples. In this chapter we 
examine four important issues in game theory—cooperation, competition, 
coexistence, and commitment—and see how they work in various strategic 
interactions. 

In order to do this, we first develop an important analytic tool, best 
response curves, which can be used to solve for equilibria in games. 


29.1 Best Response Curves 

Consider a two-person game, and put yourself in the position of one of the 
players. For any choice the other player can make, your best response 
is the choice that maximizes your payoff. If there are several choices that 
maximize your payoff, then your best response will be the set of all such 
choices. 

For example, consider the game depicted in Table 29.1, which we used to 
illustrate the concept of a Nash equilibrium. If the column player chooses 
left, row’s best response is to choose top; if column chooses right, then 
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A simple game 



Column 

Left 

Right 

Row 

Top 

2, 1 

0 , 0 

Bottom 

0 , 0 

1,2 


Table 

29.1 


row’s best response is to choose bottom. Similarly, the best responses for 
column are to play left in response to top and to play right in response to 
bottom. 

We can write this out in a little table: 


Column’s choice: Left Right 

Row’s best response: Top Bottom 

Row’s choice: Top Bottom 

Column’s best response: Left Right 


Notice that if column thinks that row will play top, then column will 
want to play left, and if row thinks that column will play left, row will 
want to play top. So the pair of choices (top, left) are mutually consistent 
in the sense that each player is making an optimal response to the other 
player’s choice. 

Consider a general two-person game in which row has choices r\,.... Vr 
and column has choices ci,... ,cc- For each choice r that row makes, let 
b c (r) be a best response for column, and for each choice c that column 
makes, let b r (c ) be a best response for row. Then a Nash equilibrium is 
a pair of strategies (r*,c*) such that 

c* = b c (r*) 
r* = b r (c*). 

The concept of Nash equilibrium formalizes the idea of “mutual consis¬ 
tency.” If row expects column to play left, then row will choose to play top, 
and if column expects row to play top, column will want to play left. So it 
is the beliefs and the actions of the players that are mutually consistent in 
a Nash equilibrium. 

Note that in some cases one of the players may be indifferent among 
several best responses. This is why we only require that c* be one of 
column’s best responses, and r* be one of row’s best responses. If there is 
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a unique best response for each choice then the best response curves can 
be represented as best response functions. 

This way of looking at the concept of a Nash equilibrium makes it clear 
that it is simply a generalization of the Cournot equilibrium described in 
Chapter 27. In the Cournot case, the choice variable is the amount of 
output produced, which is a continuous variable. The Cournot equilibrium 
has the property that each firm is choosing its profit-maximizing output, 
given the choice of the other firm. 

The Bertrand equilibrium, also described in Chapter 27, is a Nash equi¬ 
librium in pricing strategies. Each firm chooses the price that maximizes 
its profit, given the choice that it thinks the other firm will make. 

These examples show how the best response curve generalizes the earlier 
models, and allows for a relatively simple way to solve for Nash equilibrium. 
These properties make best response curves a very helpful tool to solve for 
an equilibrium of a game. 


29.2 Mixed Strategies 


Let us use best response functions to analyze the game shown in Table 
29.2. 


Table 

29.2 


Solving for Nash equilibrium. 



Ms. Column 

Left 

Right 

Mr. Row 

Top 

2, 1 

0 , 0 

Bottom 

0 , 0 

- 

1,2 


We are interested in looking for mixed strategy equilibria as well as 
pure strategy equilibria, so we let r be the probability that row plays top, 
and (1 — r) the probability that he plays bottom. Similarly, let c be the 
probability that column plays left, and (1 — c) the probability that she plays 
right. The pure strategies occur when r and c equal 0 or 1. 

Let us calculate row’s expected payoff if he chooses probability r of play¬ 
ing top and column chooses probability c of playing left. Look at the 
following array 
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Combination 
Top, Left 
Bottom, Left 
Top, Right 
Bottom, Right 


Probability 

rc 

(1 - r)c 
r(l - c) 

(! —r)(l —c) 


Payoff to Row 
2 
0 
0 
1 


To calculate the expected payoff to row, we weight row’s payoffs in the 
third column by the probability that they occur, given in the second col¬ 
umn, and add these up. The answer is 


Row’s payoff = 2rc + (1 — r)(l — c), 
which we can multiply out to be 


Row’s payoff = 2 rc + 1 — r — c + rc. 


Now suppose that row contemplates increasing r by A?’. How will his 
payoff change? 


Apayoff to row = 2c Ar — Ar + c Ar 
= (3c — l)Ar. 

This expression will be positive when 3c > 1 and negative when 3c < 1. 
Hence, row will want to increase r whenever c > 1/3, decrease r when 
c < 1/3, and be happy with any value of 0 < r < 1 when c — 1/3. 

Similarly, the payoff to column is given by 

Column’s payoff = cr + 2(1 — c)(l — r). 

Column’s payoff will change when c changes by Ac according to 

A payoff to column = r Ac + 2r Ac — 2Ac 
= (3 r - 2)Ac. 

Hence column will want to increase c whenever r > 2/3, decrease c when 
r < 2/3, and be happy with any value of 0 < c < 1 when r = 2/3. 

We can use this information to plot the best response curves. Start with 
row. If column choses c = 0, row will want to make r as small as possible, 
so r — 0 is the best response to c = 0. This choice will continue to be the 
best response up until c = 1/3, at which point any value of r between 0 
and 1 is a best response. For all c > 1/3, the best response row can make 
is r = 1. 

These curves are depicted in Figure 29.1. It is easy to see that they cross 
in three places: (0,0), (2/3,1/3), and (1,1), which correspond to the three 
Nash equilibria of this game. Two of these strategies are pure strategies, 
and one is a mixed strategy. 
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Figure 

29.1 



r 


Best response curves. The two curves depict the best re¬ 
sponse of row and column to each other’s choices. The inter¬ 
sections of the curves are Nash equilibria. In this case there are 
three equilibria, two with pure strategies and one with mixed 
strategies. 


29.3 Games of Coordination 

Armed with the tools of the last section we can examine our first class of 
games, coordination games. These are games where the payoffs to the 
players are highest when they can coordinate their strategies. The problem, 
in practice, is to develop mechanisms that enable this coordination. 


Battle of the Sexes 

The classic example of a coordination game is the so-called battle of the 
sexes. In this game, a boy and a girl want to meet at a movie but haven’t 
had a chance to arrange which one. Alas, they forgot their cell phones, 
so they have no way to coordinate their meeting and have to guess which 
movie the other will want to attend. 

The boy wants to see the latest action flick, while the girl would rather 
go to an art film, but they would both rather go to the same movie than 
not meet up at all. Payoffs consistent with these preferences are shown in 
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The battle of the sexes. 



Girl 

Action 

Art 

Boy 

Action 

2, 1 

0 , 0 

Art 

0 , 0 

1,2 


Table 

29.3 


Table 29.3. Note the defining feature of coordination games: the payoffs 
are higher when the players coordinate their actions than when they don’t. 

What are the Nash equilibria of this game? Luckily, this is just the game 
we used in the last section to illustrate best response curves. We saw there 
that there are three equilibria: both choose action, both choose art, or each 
chooses his or her preferred choice with probability 2/3. 

Since all of these are possible equilibria, it is hard to say what will happen 
from this description alone. Generally, we would look to considerations 
outside the formal description of the game to resolve the problem. For 
example, suppose that the art film was a closer destination for one of the 
two players. Then both players might reasonably suppose that would be 
the equilibrium choice. 

When players have good reasons to believe that one of the equilibria is 
more “natural” than the others, it is called a focal point of the game. 


Prisoner's Dilemma 

The prisoner’s dilemma, which we discussed extensively in the last chapter, 
is also a coordination game. Recall the story: two prisoners can either 
confess, thereby implicating the other, or deny committing a crime. The 
payoffs are shown in Table 29.4. 

The striking feature of the prisoner’s dilemma is that confessing is a 
dominant strategy, even though coordination (both choose deny) is far su¬ 
perior in terms of the total payoff. Coordination would allow the prisoners 
to choose the best payoff, but the problem is that there is no easy way to 
make it happen in a single-shot game. 

One way out of the prisoner’s dilemma is to enlarge the game by adding 
new choices. We saw in the last chapter that an indefinitely repeated pris¬ 
oner’s dilemma game could achieve the cooperative outcome via strategies 
like tit for tat, in which players rewarded cooperation and punished lack of 
cooperation through their future actions. The extra strategic consideration 
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Table 

29.4 


The prisoner’s dilemma. 



Player B 

Confess 

Deny 

Player A 

Confess 

-3,-3 

0, -6 

Deny 

-6, 0 

- 1 , -1 


here is that refusing to cooperate today may result in extended punishment 
later on. 

Another way to “solve” the prisoner’s dilemma is to add the possibility 
of contracting. For example, both players could sign a contract saying that 
they will stick with the cooperative strategy. If either of them reneges on 
the contract, he or she will have to pay a fine or be punished in some way. 
Contracts are very helpful in achieving all sorts of outcomes, but they rely 
on the existence of a legal system that will enforce such contracts. This 
makes sense for business negotiations but is not an appropriate assumption 
in other contexts, such as military games or international negotiations. 


Assurance Games 

Consider the U.S.-U.S.S.R. arms race of the 1950s in which each country 
could build nuclear missiles or refrain from building them. The payoffs 
to these strategies might look like those shown in Table 29.5. The best 
outcome for both parties is to refrain from building the missiles, giving a 
payoff of (4, 4). But if one refrains while the other builds, the payoff will 
be 3 to the builder and 1 to the refrainer. The payoff if they both build 
missile sites is (2, 2). 

It is not hard to see that there are two pure strategy Nash equilibria, 
(refrain, refrain) and (build, build). However, (refrain, refrain) is better for 
both parties. The trouble is, neither party knows which choice the other 
will make. Before committing to refrain, each party wants some assurance 
that the other will refrain. 

One way to achieve this assurance is for one of the players to move first, 
by opening itself to inspection, say. Note that this can be unilateral, at least 
as long as one believes the payoffs in the game. If one player announces 
that it is refraining from deploying nuclear missiles and gives the other 
player sufficient evidence of its choice, it can rest assured that the other 
player will also refrain. 
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An arms race. 



U.S.S.R. 

Refrain 

Build 

u.s. 

Refrain 

4, 4 

1, 3 

Build 

3, 1 

2, 2 


Table 

29.5 


Chicken 

Our last coordination game is based on an automobile game popularized in 
the movies. Two teenagers start at opposite ends of the street and drive in 
a straight line toward each other. The first to swerve loses face; if neither 
swerves, they both crash into each other. Some possible payoffs are shown 
in Table 29.6. 

There are two pure strategy Nash equilibria, (row swerves, column doesn’t) 
and (column swerves, row doesn’t). Column prefers the first equilibrium 
and row the second, but each equilibrium is better than a crash. Note the 
difference between this and the assurance game; there, both players were 
better off doing the same thing (building or refraining) than doing differ¬ 
ent things. Here, both players are worse off doing the same thing (driving 
straight or swerving) than if they did different things. 

Each player knows that if he can commit himself to driving straight, the 
other will chicken out. But of course, each player also knows that it would 
be crazy to crash into each other. So how can one of the players enforce 
his preferred equilibrium? 

One important strategy is commitment. Suppose that row ostentatiously 
fastened a steering wheel lock on his car before starting out. Column, 
recognizing that row now has no choice but to go straight, would choose 
to swerve. Of course if both players put on a lock, the outcome would be 
disastrous! 

How to Coordinate 

If you are a player in a coordination game, you may want to get the 
other player to cooperate at an equilibrium that you both like (the as¬ 
surance game), cooperate an an equilibrium one of you likes (battle of the 
sexes), play something other than the equilibrium strategy (the prisoner’s 
dilemma), or make a choice leading to your preferred outcome (chicken). 

In the assurance game, the battle of the sexes, and chicken, this can 
be accomplished by one player’s moving first, and committing herself to a 
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particular choice. The other player can then observe the choice and respond 
accordingly. In the prisoner’s dilemma, this strategy doesn’t work: if one 
player chooses not to confess, it is in the other’s interest to do so. Instead 
of sequential moves, repetition and contracting are major ways to “solve” 
the prisoner’s dilemma. 


Table 

29.6 


Chicken. 



Column 

Swerve 

Straight 

Row 

Swerve 

0 , 0 

- 1 , 1 

Straight 

1,-1 

-2, -2 


29.4 Games of Competition 


The opposite pole from cooperation is competition. This is the famous case 
of zero-sum games, so called because the payoff to one player is equal to 
the losses of the other. 

Most sports are effectively zero-sum games: a point awarded to one team 
is equivalent to a point subtracted from the other team. Competition is 
fierce in such games because the players’ interests are diametrically op¬ 
posed. 

Let us illustrate a zero-sum game by looking at soccer, known as football 
in most of the world. Row is kicking a penalty shot and column is defending. 
Row can kick to the left or kick to the right; column can favor one side and 
defend to the left or defend to the right in order to deflect the kick. 

We will express the payoffs to these strategies in terms of expected points. 
Obviously row will be more successful if column jumps the wrong way. On 
the other hand, the game may not be perfectly symmetric since row may be 
better at kicking in one direction than another and column may be better 
at defending one direction or the other. 

Let us assume that row will score 80 percent of the time if he kicks to 
the left and column jumps to the right but only 50 percent of the time if 
column jumps to the left. If row kicks to the right, we will assume that he 
succeeds 90 percent of the time if column jumps to the left but 20 percent 
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Penalty point in soccer. 



Column 

Defend left 

Defend right 

Row 

Kick left 

50, -5 

OO 

O 

1 

CO 

o 

Kick right 

90, 

20, -20 


Table 

29.7 


of the time if column jumps to the right. These payoffs are illustrated in 
Table 29.7. 

Note that the payoffs in each entry sum to zero, indicating that the play¬ 
ers have diametrically opposed goals. Row wants to maximize his expected 
payoff, and column wants to maximize her expected payoff—which means 
she wants to minimize row’s payoff. 

Obviously, if column knows which way row will kick she will have a 
tremendous advantage. Row, recognizing this, will therefore try to keep 
column guessing. In particular, he will kick sometimes to his strong side 
and sometimes to his weak side. That is, he will pursue a mixed strategy. 

If row kicks left with probability p , he will get an expected payoff of 
50 p + 90(1 — p) when column jumps left and 80p + 20(1 — p) when column 
jumps right. Row wants to make this expected payoff as big as possible, 
and column wants to make it as small as possible. 

For example, suppose that row chooses to kick left half the time. If 
column jumps left, row will have an expected payoff of 50 x 1/2 + 90 x 
1/2 = 70, and if column jumps right, row will have an expected payoff of 
80 x 1/2 + 20 x 1/2 = 50. 

Column, of course, can carry through this same reasoning. If column 
believes that row will kick to the left half the time, then column will want 
to jump to the right, since this is the choice that minimizes row’s expected 
payoff (thereby maximizing column’s expected payoff). 

Figure 29.2 shows row’s expected payoffs for different choices of p. This 
simply involves graphing the two functions 50p+90(l— p) and 80p+20(l— p). 
Since these two expressions are linear functions of p , the graphs are straight 
lines. 

Row recognizes that column will always try to minimize his expected 
payoff. Thus, for any p , the best payoff he can hope for is the minimum 
of the payoffs given by the two strategies. We’ve illustrated this by the 
colored line in Figure 29.2. 

Where does the maximum of these minimum payoffs occur? Obviously, 
it occurs at the peak of the colored line, or, equivalently, where the two 
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Figure 

29.2 



Row’s strategy. The two curves show row’s expected payoff 
as a function of p, the probability that he kicks to the left. 
Whatever p he chooses, column will try to minimize row’s payoff. 


lines intersect. We can calculate this value algebraically by solving 
50 p + 90(1 -p) = 80 p + 20(1 - p) 

for p. You should verify that the solution is p = .7. 

Hence, if row kicks to the left 70 percent of the time and column responds 
optimally, row will have an expected payoff of 50 x .7 + 90 x .3 = 62. 

What about column? We can perform a similar analysis for her choices. 
Suppose column decides to jump to the left with probability q and jump 
to the right with probability (1 — q). Then row’s expected payoff will be 
50g + 80(1 — q) if column jumps to the left and 90g + 20(1 — q) if column 
jumps to the right. For each q, column will want to minimize row’s payoff. 
But column recognizes that row wants to maximize this same payoff. 

Hence, if column chooses to jump to the left with probability 1/2, she 
recognizes that row will get an expected payoff of 50 x 1/2 + 80 x 1/2 = 65 
if row kicks left and 90 x 1/2 + 20 x 1/2 = 55 if row kicks right. In this 
case row will, of course, choose to kick left. 

We can plot the two payoffs in Figure 29.3, which is analogous to the 
previous diagram. From column’s viewpoint, it is the maximum of the two 
lines that is relevant, since this reflects row’s optimal choice for each choice 








GAMES OF COMPETITION 549 


of q. Hence, the diagram depicts these lines in color. Just as before we 
can find the best q for column—the point where row’s maximum payoff is 
minimized. This occurs where 

50g + 80(1 - q) = 90g + 20(1 - q), 
which implies q = .6. 



Column’s strategy. The two lines show row’s expected payoff 
as a function of q , the probability that column jumps to the left. 
Whatever q column chooses, row will try to maximize his own 
payoff. 


We have now calculated the equilibrium strategies for each of the two 
players. Row should kick to the left with probability .7, and column should 
jump to the left with probability .6. These values were chosen so that 
row’s payoffs and column’s payoffs will be the same, whatever the other 
player does, since we found the values by equating the payoffs from the two 
strategies the opposing player could choose. 

So when row chooses .7, column is indifferent between jumping left and 
jumping right, or, for that matter jumping left with any probability q. In 
particular, column is perfectly happy jumping left with probability .6. 

Similarly, if column jumps left with probability .6, then row is indiffer¬ 
ent between kicking left and kicking right, or any mixture of the two. In 


Figure 

29.3 
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Figure 

29.4 


particular, he is happy to kick left with probability .7. Hence these choices 
are a Nash equilibrium: each player is optimizing, given the choices of the 
other. 

In equilibrium row scores 62 percent of the time and fails to score 38 
percent of the time. This is the best he can do, if the other player responds 
optimally. 

What if column responds nonoptimally? Can row do better? To answer 
this question, we can use the best response curves introduced at the be¬ 
ginning of this chapter. We have already seen that when p is less than .7, 
column will want to jump left, and when p is greater than .7, column will 
want to jump right. Similarly when q is less than .6, row will want to kick 
left, and when q is greater than .6, row will want to kick right. 

Figure 29.4 depicts these best response curves. Note that they intersect 
at the point where p = .7 and q = .6. The nice thing about the best 
response curves is that they tell each player what to do for every choice 
the other player makes, optimal or not. The only choice that is an optimal 
response to an optimal choice is where the two curves cross—the Nash 
equilibrium. 



P 

Best response curves. These are the best response curves 
for row and column, as a function of p, the probability that row 
kicks to the left, and q , the probability that column jumps to 
the left. 
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29.5 Games of Coexistence 

We have interpreted mixed strategies as randomization by the players. In 
the penalty kick game, if row’s strategy is to play left with probability .7 
and right with probability .3, then we think that row will “mix it up” and 
play left 70 percent of the time and right 30 percent of the time. 

But there is another interpretation. Suppose that kickers and goalies are 
matched up at random and that 70 percent of the kickers always kick left 
and 30 percent always kick right. Then, from the goalie’s point of view, it 
is just like facing a single player who randomizes with those probabilities. 

This isn’t all that compelling as a story for soccer games, but it is a 
reasonable story for animal behavior. The idea is that various kinds of be¬ 
havior are genetically programmed and that evolution selects the mixtures 
of the population that are stable with respect to evolutionary forces. In re¬ 
cent years, biologists have come to regard game theory as an indispensible 
tool to study animal behavior. 

The most famous game of animal interaction is the hawk-dove game. 
This doesn’t refer to a game between hawks and doves (which would have a 
pretty predictable outcome) but rather to a game involving a single species 
that exhibits two kinds of behavior. 

Think of a wild dog. When two wild dogs come across a piece of food, 
they have to decide whether to fight or to share. Fighting is the hawkish 
strategy: one will win and one will lose. Sharing is a dovish strategy: it 
works well when the other player is also dovish, but if the other player 
is hawkish, the offer to share is rejected and the dovish player will get 
nothing. 

A possible set of payoffs is given in Table 29.8. 


Hawk-dove game. 



Column 

Hawk 

Dove 

Row 

Hawk 

-2, -2 

4, 0 

Dove 

0, 4 

2, 2 


Table 

29.8 


If both wild dogs play dove, they end up with (2, 2). If one plays hawk 
and the other plays dove, the hawkish player wins everything. But if both 
play hawk, each dog will be seriously injured. 
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It obviously can’t be an equilibrium if everyone plays hawk, since if some 
dog played dove, it would end up with 0 rather than —2. And if all dogs 
played dove, it would pay someone to deviate and play hawk. So there 
will have to be some mixture of hawk types and dove types in equilibrium. 
What sort of mixture should we expect? 

Suppose that the fraction playing hawk is p. Then a hawk will meet 
another hawk with probability p and meet a dove with probability 1 — p. 
The expected payoff to the hawk type will be 

H = —2p + 4(1 — p). 

The expected payoff to the dove type will be 

D = 2(1 — p). 

Suppose that the type that has the higher payoff reproduces more rapidly, 
passing its tendency to play hawk or dove on to its offspring. So if H > D, 
we would see the fraction of hawk types in the population increase, and if 
H < D, we would expect to see the number of dove types increase. 

The only way the population can be in equilibrium is if the payoffs to 
each type are the same. This requires 

H = -2p + 4(1 -p) = 2(1 - p) = D, 
which solves for p = 1/2. 

We have found that a 50-50 mixture of doves and hawks is an equilibrium. 
Is it stable, in some sense? We plot the payoffs to hawk and dove as a 
function of p : the fraction of the population playing hawk in Figure 29.5. 
Note that when p > 1/2, the payoff to playing hawk is less than that of 
playing dove, so we would expect to see the doves reproduce more rapidly, 
moving us back to the equilibrium 50-50 ratio. Similarly, when p < 1/2, 
the payoff to hawk is greater than the payoff to dove, leading the hawks to 
reproduce more rapidly. 

This argument shows that not only is p = 1/2 an equilibrium but it 
is also stable under evolutionary forces. Considerations of this sort lead 
to a concept known as an evolutionarily stable strategy or an ESS . 1 
Remarkably, an ESS turns out to be a Nash equilibrium, even though it 
was derived from quite different considerations. 

The Nash equilibrium concept was designed to deal with calculating, 
rational individuals, each of whom is trying to devise a strategy appropriate 
for the best strategy the other player might choose. The ESS was designed 
to model animal behavior under evolutionary forces, where strategies that 
had greater fitness payoffs would reproduce more rapidly. But the ESS 
equilibria are also Nash equilibria, giving another argument for why this 
particular concept in game theory is so compelling. 

1 See John Maynard Smith, Evolution and the Theory of Games , (Cambridge Univer¬ 
sity Press, 1982). 
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FRACTION PLAYING HAWK 


Payoffs in the hawk-dove game. The payoff to hawk is 
depicted in color; the payoff to dove is in black. When p > 1/2, 
the payoff to hawk is less than dove and vice versa, showing that 
the equilibrium is stable. 


29.6 Games of Commitment 

The previous examples involving games of cooperation and competition 
have been concerned with games with simultaneous moves. Each player 
had to make his or her choice without knowing what the other player was 
choosing (or had chosen). Indeed, games of coordination or competition 
can be quite trivial if one player knows the other’s choices. 

In this section we turn our attention to games with sequential moves. 
An important strategic issue that arises in such games is commitment. 
To see how this works, look back at the game of chicken described earlier 
in this chapter. We saw there that if one player could force himself to 
choose straight, the other player would optimally choose to swerve. In the 
assurance game, the outcome would be better for both players if one of 
them moved first. 

Note that this committed choice must be both irreversible and observable 
by the other player. Irreversibility is part of what it means to be committed, 
while observability is crucial if the other player is going to be persuaded to 
change his or her behavior. 


Figure 

29.5 


The Frog and the Scorpion 

We begin with the fable of the frog and the scorpion. They were standing 
on the bank of the river, trying to figure out a way across. “I know,” said 
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Figure 

29.6 


the scorpion “I will climb on your back and you can swim across the river.” 
The frog said, “But what if you sting me with your stinger?” The scorpion 
said, “Why would I do that? Then we would both die.” 

The frog found this convincing, so the scorpion climbed on his back and 
they started across the river. Halfway across, at the deepest point, the 
scorpion stung the frog. Writhing in pain, the frog cried out, “Why did 
you do that? Now we are both doomed!” “Alas,” said the scorpion, as he 
sank into the river, “it is my nature.” 

Let’s look at the frog and the scorpion from the viewpoint of game theory. 
Figure 29.6 depicts a sequential game with payoffs consistent with the story. 
Start at the bottom of the game tree. If the frog refuses the scorpion, 
both get nothing. Looking up one line, we see that if the frog carries the 
scorpion, he receives utility 5, for doing a good deed, and the scorpion 
receives a payoff of 3, for getting across the river. In line where the frog 
is stung, he receives a payoff of —10, and the scorpion gets a payoff of 5, 
representing the satisfaction from fulfilling his natural instincts. 


FROG, 



The frog and the scorpion. If the frog chooses to carry the 
scorpion, the scorpion will choose to sting him and both will 
die. 


It is best to start with the final move of the game: the scorpion’s choice 
of sting or refrain. Stinging has a higher payoff to the scorpion because 
“it is his nature” to sting. Hence the frog should rationally choose to 
refuse to carry the scorpion. Unfortunately, the frog didn’t understand 
the scorpion’s payoffs; apparently, he thought that the scorpion’s payoffs 
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looked something like those in Figure 29.7. Alas, this mistake was fatal for 
the frog. 

A smart frog would figure out some way to make the scorpion commit to 
not stinging. He could, for example, tie his tail. Or he could hire a hit frog, 
who would retaliate against the scorpion’s family. Whatever the strategy, 
the critical thing for the frog to do is to change the payoffs to the scorpion 
by making stinging more costly or refraining more rewarding. 


FROG, 



The frog and the scorpion. With these payoffs, if the frog 
chooses to carry the scorpion, the scorpion will not choose to 
sting him, and both will make it across the river safely. 


The Kindly Kidnapper 

Kidnapping for ransom is a big business in some parts of the world. In 
Columbia, it is estimated that there are over 2,000 kidnappings for ransom 
per year. In the former Soviet Union, kidnappings rose from 5 in 1992 to 
105 in 1999. Many of the victims are Western businesspeople. 

Some countries, such as Italy, have laws against paying ransom. The 
reasoning is that if the victim’s family or employers can commit themselves 
not to pay ransom, then the kidnappers will have no motive to abduct the 
victim in the first place. 

The problem is, of course, once a kidnapping has taken place, a victim’s 
family will prefer to pay the kidnappers, even if it is illegal to do so. Hence 
penalties for paying ransom may not be effective as a commitment device. 


Figure 

29.7 
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Figure 

29.8 


Suppose some kidnappers abduct a hostage and then discover that they 
can’t get paid. Should they release the hostage? The hostage, of course, 
promises not to reveal the identity of the kidnappers. But will he keep 
this promise? Once he is released, he has no incentive to do so—and every 
incentive to try to punish the kidnappers. Even if the kidnappers want to 
let the hostage go, they can’t do so for fear of being identified. 

Figure 29.8 depicts some possible payoffs. The kidnapper would feel bad 
about killing the hostage, receiving a payoff of —3. Of course, the hostage 
would feel even worse, receiving a payoff of —10. If the hostage is released, 
and refrains from identifying the kidnapper, the hostage gets a payoff of 3 
and the kidnapper gets a payoff of 5. But if the hostage does identify the 
kidnapper, he gets a payoff of 5, leaving the kidnapper with a payoff of —5. 


KIDNAPPER, 



Kidnap game. The kidnapper would like to release the hostage, 
but if he does, the hostage will identify him. 


Now it is the hostage who has the commitment problem: how can he 
convince the kidnappers that he won’t renege on his promise and reveal 
their identity? 

The hostage needs to figure out a way to change the payoffs of the game. 
In particular, he needs to find a way to impose a cost on himself if he 
identifies the kidnappers. 

Thomas Schelling, an economist at the University of Maryland who has 
worked extensively on strategic analysis in dynamic games, suggests that 
the hostage might have the kidnappers photograph him in some embarrass¬ 
ing act and leave them with the photos. This effectively changes the payoffs 
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from his subsequently revealing the identity of the kidnappers, since they 
then have the option of revealing the embarrassing photograph. 

This sort of strategy is known as an “exchange of hostages.” In the 
Middle Ages, when two kings wanted to ensure a contract wouldn’t be 
broken, they would exchange hostages such as family members. If either 
king broke the agreement, the hostages would be sacrificed. Neither wanted 
to sacrifice their family members, so each king would have an incentive to 
respect the terms of their contract. 

In the case of the kidnapping, the embarrassing photo would impose 
costs on the hostage if it were released, thereby ensuring that he will stick 
to his agreement not to reveal the identity of the kidnappers. 


When Strength Is Weakness 

Our next example comes from the world of animal psychology. It turns out 
that pigs quickly establish dominance-subordinateness relations, in which 
the dominant pig tends to boss the subordinate pig around. 

Some psychologists put two pigs, one dominant, one subordinate, in a 
long pen. 2 At one end of the pen was a lever that would release a portion 
of food to a trough located at the other end of the pen. The question of 
interest was this: which pig would push the lever and which would eat the 
food? 

Somewhat surprisingly the outcome of the experiment was that the dom¬ 
inant pig pressed the lever, while the subordinate pig waited for the food. 
The subordinate pig then ate most of the food, while the dominant pig 
rushed as fast as it could to the trough end of the pen, ending up with only 
a few scraps. Table 29.9 depicts a game that illustrates the problem. 


Pigs pressing levers. 



Dominant Pig 

Don't 
press lever 

Press lever 

Subordinate 

Pig 

Don't 
press lever 

0 , 0 

4, 1 

Press lever 

0, 5 

2, 3 


Table 

29.9 


The original reference is Baldwin and Meese, “Social Behavior in Pigs Studied by 
Means of Operant Conditioning,” (Animal Behavior, (1979)). I draw on the de¬ 
scription of John Maynard Smith, Evolution and the Theory of Games (Cambridge 
University Press, 1982). 


2 
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The subordinate pig compares a payoff of (0, 4) to (0, 2) and concludes, 
sensibly enough, that pressing the lever is dominated by not pressing it. 
Given that the subordinate pig doesn’t press the lever, the dominant pig 
has no choice but to do so. 

If the dominant pig could refrain from eating all the food and reward the 
subordinate pig for pressing the lever, it could achieve a better outcome. 
The problem is that pigs have no contracts, and the dominant pig can’t 
help being a hog! 

As in the case of the kindly kidnapper, the dominant pig has a commit¬ 
ment problem. If he could only commit to not eating all the food, he would 
end up much better off. 


Savings and Social Security 

Commitment problems aren’t limited to the animal world. They also show 
up in economic policy. 

Saving for retirement is an interesting and timely example. Everyone 
gives lip service to the fact that saving is a good idea. Unfortunately, few 
people actually do it. Part of the reason for the reluctance to save is that 
individuals recognize that society won’t let them starve, so there is a good 
chance they will be bailed out later on. 

To formulate this in a game between the generations, let’s consider two 
strategies for the older generation: save or squander. The younger genera¬ 
tion likewise has two strategies: support their elders or save for their own 
retirement. A possible game matrix is shown in Table 29.10. 


Table 

29.10 


Intergenerational conflict over savings. 



Younger Generation 

Support 

Refrain 

Older 

Generation 

Save 

3,-1 

1,0 

Squander 

2, -1 

-2, 2 


If the older generation saves and the younger generation also supports 
them, the old folks end up with a utility level of 3 and the young folks end 
up with —1. If the older generation squanders and the younger generation 
supports them, the elders end up with a utility of 2 and the young folks 
end up with — 1. 
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If the younger generation refrains from providing support to their elders 
and the older generation saves, the old folks get 1 and the young folks get 0. 
Finally, if the old folks squander and the young folks neglect them, each 
ends up with utility of —2, the old folks from starving and the young folks 
from having to watch. 

It is not hard to see that there are two Nash equilibria in this game. If 
the old folks choose to save, then the young folks will choose optimally to 
neglect them. But if the old folks choose to squander, then it is optimal 
for the younger generation to support them. And of course, given that the 
younger generation will support their elders, it is optimal for their elders 
to squander! 

However, this analysis ignores the time structure of the game: one of the 
(few) advantages of being old is that you get to move first. If we draw out 
the game tree, the payoffs become those in Figure 29.9. 



OLD, 


YOUNG 2 ' 

CHOOSE^ -- 

OLD 

CHOOSE_ 

Squander,,*'''^ 

Refrairi~'~'-~\^ 2 2 


3 ' 

Save \ 



The savings game in extended form. Knowing that the 
younger generation will support them, the older generation chooses 
to squander. The subgame perfect equilibrium is (support, 
squander). 


If the oldsters save, the youngsters will choose to neglect them, so the 
oldsters end up with a payoff of 1. If the oldsters squander, they know that 
the youngsters won’t be able to bear watching them starve, so the oldsters 
end up with a payoff of 3. Hence the sensible thing for the oldsters to do 
is to squander, knowing they will be bailed out later on. 


Figure 

29.9 
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Of course, most developed countries now have a program like the U.S. 
Social Security program that forces each generation to save for retirement. 


Hold Up 

Consider the following strategic interaction. You hire a contractor to build 
a warehouse. After the plans are approved and the construction is almost 
done, you realize that the color is bad, so you ask the contractor to change 
the paint, which involves a trivial expense. The contractor comes back and 
says: “That change order will be $1500, please.” 

You recognize that it will cost you at least that much to delay completion 
until you can find a painter, and you really do want the new color, so, 
muttering under your breath, you pay the cost. Congratulations, you have 
been held up! 

Of course, contractors are not the only party at fault in this sort of game. 
The clients can “hold up” their payment as well, causing lots of grief for 
the contractor. 

The game tree for the hold-up problem is depicted in Figure 29.10. We 
suppose that the value the owner places on having the new paint is $1500 
and that the actual cost of painting is $200. Starting at the top of leaves 
of the tree, if the contractor charges $1500, it will realize a profit of $1300, 
and the client gets a net utility of zero. 

If the client looks for another painter, it will cost him $200 to pay the 
painter and, say, $1400 in lost time. He gets the color he wants which is 
worth $1500, buy has to pay $1600 in direct costs and delay costs, leaving 
him with a net loss of $100. 

If the contractor charges the client the actual cost of $200, he breaks 
even and the client gets a $1500 value for $200, leaving him with a net 
payoff of $1300. 

As can be seen, the optimal choice for the contractor is to extort the pay¬ 
ment, and the optimal choice for the customer is to give in. But a sensible 
client will recognize that change orders will occur in any project. Because 
of this, the client will be reluctant to hire contractors with a reputation for 
extortion which is, of course, bad for the contractor. 

How do firms solve the hold-up problem? The basic answer is contracts. 
Normally, contractors negotiate a contract specifying what kinds of change 
orders are appropriate and how their costs will be determined. Sometimes 
there are even arbitration or other dispute resolution procedures built into 
the contracts. A lot of time, energy, and money goes into writing contracts 
just to make certain that hold up won’t occur. 

But contracts aren’t the only solution. Another way to solve the problem 
is through commitment. For example, the contractor might post a bond 
guaranteeing timely completion of the project. Again, there will generally 
be some objectively specified terms about what constitutes completion. 
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CONTRACTOR, 
1300, 


0 , 


0 , 


The hold-up problem. The contractor charges a high price 
for the change since the client has no alternative. 


Figure 

29.10 


Another important factor is reputation. Obviously, a contractor who 
persistently tries to extort his customers will get a bad reputation. He 
won’t be hired again by this customer, and he certainly won’t get good 
recommendations. This reputation effect can be examined in a repeated 
game context in which hold up today will cost the contractor in the future. 


29.7 Bargaining 

The classical bargaining problem is divide the dollar. Two players have a 
dollar that they want to divide between them. How do they do it? 

The problem, as stated, has no answer since there is too little information 
to construct a reasonable model. The challenge in modeling bargaining is 
to find some other dimensions on which the players can negotiate. 

One solution, the Nash bargaining model, takes an axiomatic ap¬ 
proach by specifying certain properties that a reasonable bargaining so¬ 
lution should have and then proving that there is only one outcome that 
satisfies these axioms. 

The outcome ends up depending on how risk averse the players are and 
what will happen if no bargain is made. Unfortunately, a full treatment of 
this model is beyond the scope of this book. 

An alternative approach, the Rubinstein bargaining model, looks at 
a sequence of choices and then solves for the subgame perfect equilibrium. 
Luckily the basic insight of this model is easy to illustrate in simple cases. 

Two players, Alice and Bob, have $1 to divide between them. They 
agree to spend at most three days negotiating over the division. The first 
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day, Alice will make an offer, Bob either accepts or comes back with a 
counteroffer the next day, and on the third day Alice gets to make one final 
offer. If they cannot reach an agreement in three days, both players get 
zero. 

We assume Alice and Bob differ in their degree of impatience: Alice 
discounts payoffs in the future at a rate of a per day, and Bob discounts 
payoffs at a rate of (3 per day. Finally, we assume that if a player is 
indifferent between two offers, he will accept the one that is most preferred 
by his opponent. This idea is that the opponent could offer some arbitrarily 
small amount that would make the player strictly prefer one choice and 
that this assumption allows us to approximate such an “arbitrarily small 
amount” by zero. It turns out that there is a unique subgame perfect 
equilibrium of this bargaining game. 

We start our analysis at the end of the game, right before the last day. 
At this point Alice can make a take-it-or-leave-it offer to Bob. Clearly, 
the optimal thing for Alice to do at this point is to offer Bob the smallest 
possible amount that he would accept, which, by assumption, is zero. So if 
the game actually lasts three days, Alice would get $1 and Bob would get 
zero (i.e., an arbitrarily small amount). 

Now go back to the previous move, when Bob gets to propose a division. 
At this point Bob should realize that Alice can guarantee herself $1 on the 
next move by simply rejecting his offer. A dollar next period is worth a 
to Alice this period, so any offer less than a would be sure to be rejected. 
Bob certainly prefers 1 — a now to zero next period, so he should rationally 
offer a to Alice, which Alice will then accept. So if the game ends on the 
second move, Alice gets a and Bob gets 1 — a. 

Now move to the first day. At this point Alice gets to make the offer and 
he realizes that Bob can get 1 — a if he simply waits until the second day. 
Hence Alice must offer a payoff that has at least this present value to Bob 
in order to avoid delay. Thus she offers (3(1 — a) to Bob. Bob finds this 
(just) acceptable and the game ends. The final outcome is that the game 
ends on the first move with Alice receiving 1 — (3( 1 — a) and Bob receiving 
(3(1 - a). 

The first panel in Figure 29.11 illustrates this process for the case where 
a = (3 < 1. The outermost diagonal line shows the possible payoff patterns 
on the first day, namely, all payoffs of the form xa + xb = 1. The next 
diagonal line moving toward the origin shows the present value of the pay¬ 
offs if the game ends in the second period: xa + Xb = ot. The diagonal 
line closest to the origin shows the present value of the payoffs if the game 
ends in the third period; the equation for this line is xa + Xb = a 2 . The 
right-angled path depicts the minimum acceptable divisions each period, 
leading up to the final subgame perfect equilibrium. The second panel in 
Figure 29.11 shows how the same process might look with more stages in 
the negotiation. 

It is natural to let the horizon go to infinity and ask what happens in the 
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A bargaining game. The heavy line connects together the 
equilibrium outcomes in the subgames. The point on the line 
that is furthest out is the subgame perfect equilibrium. 


infinite game. It turns out that the subgame perfect equilibrium division 
is 

Payoff to Alice = 

1 — ap 

Payoff to Bob = ^ ~ a ) . 

1 — ap 

Note that if a = 1 and (3 < 1, then Alice receives the entire payoff. 


The Ultimatum Game 

The Rubinstein bargaining model is so elegant that economists rushed to 
test it in the laboratory. They found, alas, that elegance does not imply 
accuracy. Naive subjects (i.e., noneconomics majors) aren’t very good at 
looking ahead more than one or two steps, if that. 

In addition, there are other factors that cause problems. To see this, 
let us examine a one-step version of the bargaining model described above. 
Alice and Bob still have $1 to divide between them. Alice proposes a 
division, and, if Bob agrees, the game ends. The question is, what should 
Alice say? 

According to the theory, she should propose something like 99 cents for 
Alice, 1 cent for Bob. Bob, figuring that 1 cent is better than nothing, 
accepts, and Alice goes home happy that she studied economics. 

Unfortunately, it doesn’t work out like that. A more likely outcome is 
that Bob, disgusted by the paltry 1 cent, says “No way,” and Alice ends 


Figure 

29.11 
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up with nothing. Alice, recognizing this possibility, will tend to sweeten 
the offer. In actual experiments, the average offer for U.S. undergraduates 
is about 45 cents, and this offer tends to be accepted most of the time. 

The offering players are behaving rationally, in the sense that the 45 cent 
offer is pretty close to maximizing the expected payoff, given the observed 
frequency of rejection. It is the receiving players who behave differently 
than the theory predicts, since they reject small offers, even though this 
makes them worse off. 

There are many proposed explanations for this. One view is that too 
small an offer violates social norms of behavior. Indeed, economists have 
found quite significant cross-cultural differences in behavior in ultimatum 
games. Another, not inconsistent view, is that receivers get some utility 
payoff from hurting the offerers, in retaliation for the small offer. After all, 
if all you are losing is a penny, the satisfaction of striking back at the other 
player is pretty attractive by comparison. We will the ultimatum game in 
more detail in the next chapter. 


Summary 

1. A player’s best response function gives the optimal choice for him as a 
function of the choices the other player(s) might make. 

2. A Nash equilibrium in a two-person game is a pair of strategies, one for 
each player, each of which is a best response to the other. 

3. A mixed strategy Nash equilibrium involves randomizing among several 
strategies. 

4. Common games of coordination are the battle of the sexes, where both 
players want to do the same thing rather than different things; the pris¬ 
oner’s dilemma, where the dominant strategy ends up hurting both players; 
the assurance game, where both players want to cooperate as long as they 
think the other will cooperate; and chicken, where players want to avoid 
doing the same thing. 

5. A two-person zero-sum game is one where the payoffs to one player are 
the negative of the payoffs to the other. 

6. Evolutionary games are concerned with outcomes that are stable under 
population reproduction. 

7. In sequential games, players move in turn. Each player therefore has to 
reason about what the other will do in response to his or her choices. 

8. In many sequential games, commitment is an important issue. Finding 
ways to force commitment to play particular strategies can be important. 
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REVIEW QUESTIONS 


1. In a two-person Nash equilibrium, each player is making a best response 
to what? In a dominant strategy equilibrium, each player is making a best 
response to what? 

2. Look at the best responses for row and column in the section on mixed 
strategies. Do these give rise to best response functions? 

3. If both players make the same choice in a coordination game, all will be 
well. 

4. The text claims that row scores 62 percent of the time in equilibrium. 
Where does this number come from? 

5. A contractor says that he intends to “low-ball the bid and make up for 
it on change orders.” What does he mean? 


CHAPTER 



BEHAVIORAL 

ECONOMICS 


The economic model of consumer choice that we have studied is simple 
and elegant, and is a reasonable starting place for many sorts of analy¬ 
sis. However, it is most definitely not the whole story, and in many cases 
a deeper model of consumer behavior is necessary to accurately describe 
choice behavior. 

The field of behavioral economics is devoted to studying how con¬ 
sumers actually make choices. It uses some of the insights from psychology 
to develop predictions about choices people will make and many of these 
predictions are at odds with the conventional economic model of “rational” 
consumers. 

In this chapter we will look at some of the most important phenomena 
that have been identified by behavioral economists, and contrast the pre¬ 
dictions of these behavioral theories with those presented earlier in this 
book. 1 


In writing this chapter, I have found Colin F. Camerer, George Loewenstein, and 
Matthew Rabin’s book Advances in Behavioral Economics, Princeton University 
Press, 2003, to be very useful, particularly the introductory survey by Camerer and 
Loewenstein. Other works will be noted as the relevant topics are discussed. 


l 
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30.1 Framing Effects in Consumer Choice 

In the basic model of consumer behavior, the choices were described in the 
abstract: red pencils or blue pencils, hamburgers and french fries, and so 
on. However, in real life, people are strongly affected by how choices are 
presented to them or framed. 

A faded pair of jeans in a thrift shop may be perceived very differently 
than the same jeans sold in an exclusive store. The decision to buy a 
stock may feel quite different than the decision to sell a stock, even if both 
transactions end up with the same portfolio. A store might sell dozens of 
copies of a book priced at $29.95, whereas the same book priced at $29.00 
would have substantially fewer sales. 

These are all examples of framing effects, and they are clearly a pow¬ 
erful force in choice behavior. Indeed, much of marketing practice is based 
on understanding and utilizing such biases in consumer choice. 


The Disease Dilemma 

Framing effects are particularly common in choices involving uncertainty. 
For example, consider the following decision problem: 2 

A serious disease threatens 600 people. You are offered a choice between 
two treatments, A and B, which will yield the following outcomes. 

Treatment A. Saving 200 lives for sure. 

Treatment B. A 1/3 chance of saving 600 lives and a 2/3 chance of saving 
no one. 

Which would you choose? Now consider the choices between these treat¬ 
ments. 


Treatment C. Having 400 people die for sure. 

Treatment D. A 2/3 chance of 600 people dying and a 1/3 chance of no 
one dying. 


Now which treatment would you choose? 


2 


A. Tversky and D. Kahneman, 1981, “The framing of decisions and the psychology 
of choice,” Science, 211, 453-458. 
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In the positive framing comparison—which describes how many people 
will live—most individuals choose A over B, but in the negative framing 
comparison most people choose D over C even though the outcomes in A-C 
and B-D are exactly the same. Apparently, framing the question positively 
(in terms of lives saved) makes a treatment much more attractive than 
framing the choice negatively (in terms of lives lost). 

Even expert decisions makers can fall into this trap. When psychologists 
tried this question on a group of physicians, 72 percent of them chose the 
safe treatment A over the risky treatment B. But when the question was 
framed negatively, only 22 percent chose the risky treatment C while 72 
percent chose the safe treatment. 

Though few of us are faced with life-or-death decisions, there are similar 
examples for more mundane choices, such as buying or selling stocks. A 
rational choice of an investment portfolio would, ideally, depend on an 
assessment of the possible outcomes of the investments rather than how 
one acquired those investments. 

For example, suppose that you are given 100 shares of stock in Concrete- 
Blocks.com (whose slogan is “We give away the blocks, you pay for packing 
and shipping”). You might be reluctant to sell shares you received as a gift 
despite the fact that you would never consider buying them yourself. 

People are often reluctant to sell losing stocks, thinking that they will 
“come back.” Maybe they will, maybe they won’t. But ultimately you 
shouldn’t let history determine your investment portfolio—the right ques¬ 
tion to ask is whether you have the portfolio choices today that you want. 


Anchoring Effects 

The hypothetical ConcreteBlocks.com example described above is related 
to the so-called anchoring effect. The idea here is that people’s choices 
can be influenced by completely spurious information. In a classic study 
the experimenter spun a wheel of fortune and pointed out the number that 
came up to a subject. 3 The subject was then asked whether the number 
of African countries in the United Nations was greater or less than the 
number on the wheel of fortune. 

After they responded, the subjects were asked for their best guess about 
how many African countries were in the United Nations. Even though the 
number shown on the wheel of fortune was obviously random, it exerted a 
significant influence on the subjects’ reported guesses. 

In a similar experimental design, MBA students were given an expensive 
bottle of wine and then asked if they would pay an amount for that bottle 
equal to the last two digits of their Social Security number. For example, 

3 D. Kahneman and A. Tversky, 1974, “Judgment under uncertainty: Heuristics and 

biases,” Science , 185: 1124-1131. 
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if the last two digits were 29, the question was “Would you pay $29 for this 
bottle of wine?” 

After answering that question, the students were asked what the maxi¬ 
mum amount is that they were willing to pay for the wine. Their answers 
to this latter question were strongly influenced by the price determined by 
the last two digits of their Social Security number. For example, those with 
Social Security digits of 50 or under were willing to pay $11.62 on average, 
while those with digits in the upper half of the distribution were willing to 
pay $19.95 on average. 

Again, these choices seem like mere laboratory games. However, there 
are very serious economic decisions that can also be influenced by minor 
variations in the way the choice is framed. 

Consider, for example, choices of pension plans. 4 

Some economists looked at data from three employers that offered au¬ 
tomatic enrollment in 401 (k) plans. Employees could opt out, but they 
had to make an explicit choice to do so. The economists found that the 
participation rate in these programs with automatic enrollment was spec¬ 
tacularly high, with over 85 percent of workers accepting the default choice 
of enrolling in the 401(k) plans. 

That’s the good news. The bad news is that almost all of these workers 
also chose the default investment, typically a money market fund with very 
low returns and a low monthly contribution. Presumably, the employers 
made the default investment highly conservative to eliminate downside risk 
and possible employee lawsuits. 

In subsequent work, these economists examined the experience at a com¬ 
pany where there was no default choice of pension plan: within a month 
of starting work, employees were required to choose either to enroll in the 
401 (k) plan or to postpone enrollment. 

By eliminating the standard default choices of non-enrollment, and of 
enrollment in a fund that had low rates of return, this “active decision” 
approach raised participation rates from 35 percent to 70 percent for newly 
hired employees. Moreover, employees who enrolled in the 401(k) plan 
overwhelmingly chose high savings rates. 

As this example illustrates, careful design of human resources benefits 
programs can make a striking difference in which programs are chosen, 
potentially having a large effect on consumer savings behavior. 

Bracketing 

People often have trouble understanding their own behavior, finding it too 
difficult to predict what they will actually choose in different circumstances. 

4 James Choi, David Laibson, Brigitte Madrian, and Andrew Metrick, “For Better 

or for Worse: Default Effects and 401(k) Savings Behavior,” NBER working paper, 

W8651, 2001. 
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For example, a marketing professor gave students a choice of six different 
snacks that they could consume in each of three successive weeks during 
class. 5 (You should be so lucky!) In one treatment, the students had to 
choose the snacks in advance; in the other treatment, they chose the snacks 
on each day then immediately consumed them. 

When the students had to choose in advance, they chose a much more 
diverse set of snacks. In fact, 64 percent chose a different snack each week in 
this treatment compared to only 9 percent in the other group. When faced 
with making the choices all at once, people apparently preferred variety 
to exclusivity. But when it came down to actually choosing, they made 
the choice with which they were most comfortable. We are all creatures of 
habit, even in our choice of snacks. 

Too Much Choice 

Conventional theory argues that more choice is better. However, this claim 
ignores the costs of making choices. In affluent countries, consumers can 
easily become overwhelmed with choices, making it difficult for them to 
arrive at a decision. 

In one experiment, two marketing researchers set up sampling booths 
for jam in a supermarket. 6 One booth offered 24 flavors and one offered 
only 6. More people stopped at the larger display, but substantially more 
people actually bought jam at the smaller display. More choice seemed to 
be attractive to shoppers, but the profusion of choices in the larger display 
appeared to make it more difficult for the shoppers to reach a decision. 

Two experts in behavioral finance wondered whether the same problem 
with “excessive choice” showed up in investor decisions. They found that 
people who designed their own retirement portfolios tended to be just as 
happy with the average portfolio chosen by their co-workers as they were 
with their own choice. Having the flexibility to construct their own retire¬ 
ment portfolios didn’t seem to make investors feel better off. 7 

Constructed Preferences 

How are we to interpret these examples? Psychologists and behavioral 
economists argue that preferences are not a guide to choice; rather, prefer¬ 
ences are “discovered” in part through the experiences of choice. 

5 I. Simonson, 1990, “The effect of purchase quantity and timing on variety-seeking 
behavior,” Journal of Marketing Research, 17: 150—164. 

6 Sheena S. Iyengar and Mark R. Lepper, “When choice is demotivating: can one desire 
too much of a good thing?” Journal of Personality and Social Psychology, 2000. 

7 Shlomo Benartzi and Richard Thaler, “How Much Is Investor Autonomy Worth?” 
UCLA working paper, 2001. 
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Imagine watching someone in the supermarket picking up a tomato, 
putting it clown, then picking it up again. Do they want it or not? Is 
the price-quality combination offered acceptable? When you watch such 
behavior, you are seeing someone who is “on the margin” in terms of mak¬ 
ing the choice. They are, in the psychologists’ interpretation, discovering 
their preferences. 

Conventional theory treats preferences as preexisting. In this view, pref¬ 
erences explain behavior. Psychologists instead think of preferences as 
being constructed—people develop or create preferences through the act of 
choosing and consuming. 

It seems likely that the psychological model is a better description of what 
actually happens. However, the two viewpoints are not entirely incompat¬ 
ible. As we have seen, once preferences have been discovered, albeit by 
some mysterious process, they tend to become built-in to choices. Choices, 
once made, tend to anchor decisions. If you tried to buy that tomato from 
that consumer once they have finally decided to choose it, you would likely 
have to pay more than it cost them. 


30.2 Uncertainty 

Ordinary choice is complicated enough, but choice under uncertainty tends 
to be particularly tricky. We’ve already seen that people’s decisions may 
depend on how choice alternatives are phrased. But there are many other 
biases in behavior in this domain. 


Law of Small Numbers 

If you have taken a course in statistics, you might be familiar with the Law 
of Large Numbers. This is a mathematical principle that says (roughly) 
that the average of a large sample from a population tends to be close to 
the mean of that population. 

The Law of Small Numbers is a psychological statement that says that 
people tend to be overly influenced by small samples, particularly if they 
experience them themselves. 8 
Consider the following question: 9 


8 The term originated with A. Tversky and D. Kahneman, 1971, “Belief in the law of 
small numbers,” Psychological Bulletin, 76, 2: 105-110. Much of the following discus¬ 
sion is based on a working paper by Matthew Rabin of the University of California 
at Berkeley entitled “Inference by Believers in the Law of Small Numbers.” 

9 A. Tversky and D. Kahneman, 1982, “Judgments of and by Representativeness,” in 
Judgment under Uncertainty: Heuristics and Biases, D. Kahneman, P. Slovic, and 
A. Tversky, Cambridge University Press, 84-98. 
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“A certain town is served by two hospitals. In the larger hospital about 
45 babies are born each day, and in the smaller hospital about 15 babies 
are born each day. As you know, about 50 percent of all babies are boys. 
However, the exact percentage varies from day to day. Sometimes it may 
be higher than 50 percent, sometimes lower. For a period of 1 year, each 
hospital recorded the days on which more than 60 percent of the babies 
born were boys. Which hospital do you think recorded more such days?” 

In a survey of college students, 22 percent of the subjects said that they 
thought that it was more likely that the larger hospital recorded more such 
days, while 56 percent said that they thought the number of days would be 
about the same. Only 22 percent correctly said that the smaller hospital 
would report more days. 

If the correct account seems peculiar to you, suppose the smaller hospi¬ 
tal recorded 2 births per day and the larger hospital 100 births per day. 
Roughly 25 percent of the time the smaller hospital would have 100 percent 
male births, while this would be very rare for the large hospital. 

It appears that people expect samples to look like the distribution from 
which they are drawn. Or, saying this another way, people underestimate 
the actual magnitude of the fluctuations in a sample. 

A related issue is that people find it difficult to recognize randomness. In 
one experiment, subjects were asked to write down a series of 150 “random” 
coin tosses. About 15 percent of the sequences they produced had heads 
or tails three times in a row, but this pattern would occur randomly about 
25 percent of the time. Only 3 percent of the subjects’ sequences had 4 
heads or 4 tails in a row, while probability theory says that this should 
occur about 12 percent of the time. 

This has important implications for game theory, for example. We saw 
that in many cases people should try to randomize their strategy choices 
so as to keep their opponents guessing. But, as the psychological literature 
shows, people aren’t very good at randomizing. On the other hand, people 
aren’t very good at detecting non-random behavior either, at least without 
some training in statistics. The point of mixed strategy equilibria is not 
that choices are mathematically unpredictable, but rather that they should 
be unpredictable by the players in the game. 

Some economic researchers studied final and semi-final tennis matches at 
Wimbledon. 10 Ideally, tennis players should switch their serves from side 
to side so that their opponent can’t guess which side the serve is coming 
from. However, even very accomplished players can’t do this quite as well 
as one might expect. According to the authors: 

“Our tests indicate that the tennis players are not quite playing ran- 


10 M. Walker and J. Wooders, 1999, “Minimax Play at Wimbledon,” University of Ari¬ 
zona working paper. 
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domly: they switch their serves from left to right and vice versa somewhat 
too often to be consistent with random play. This is consistent with ex¬ 
tensive experimental research in psychology and economics which indicates 
that people who are attempting to behave truly randomly tend to “switch 
too often.” 


Asset Integration and Loss Aversion 

In our study of expected utility we made an implicit assumption that what 
individuals cared about was the total amount of wealth that they ended 
up with in various outcomes. This is known as the asset integration 
hypothesis. 

Even though most people would accept this as a reasonable thing to do, 
it is hard to put into practice (even for economists). In general, people 
tend to avoid too many small risks and accept too many large risks. 

Suppose that you make $100,000 a year and that you are offered a coin 
flip. If heads comes up you get $14 and if tails comes up you lose $10. This 
bet has an expected value of $12 and has a minuscule effect on your total 
income in a given year. Unless you have moral scruples about gambling, 
this would be a very attractive bet and you should almost certainly take 
it. However, a surprisingly large number of people won’t take such a bet. 

This excess risk aversion shows up in insurance markets where people 
tend to over-insure themselves against various small events. For example, 
people buy insurance against loosing their cell phone, even though they 
can often replace it at quite a low cost. People also buy auto insurance 
with deductibles that are much too low to make economic sense. 

In general, when making insurance decisions you should look at the 
“house odds.” If cell phone insurance costs you $3 a month, or $36 a 
year, and a new cell phone costs $180, then the house odds are 36/180, 
or 20 percent. The cell phone insurance would pay off in expected value 
only if you have more than a 20 percent chance of losing your phone or if 
it would be an extreme financial hardship to replace it. 

It appears that people aren’t really risk averse as much as they are 
loss averse. That is, people put seemingly excessive weight on the status 
quo—where they start—as opposed to where they end up. 

In an experiment that has been replicated many times, two researchers 
gave half of the subjects in a group coffee mugs. 11 They asked this group to 
report the lowest price at which they would sell the mugs. Then they asked 
the group that didn’t have mugs the highest price at which they would buy 
a mug. Since the groups were chosen randomly, the buying and selling 
prices should be about equal. However, in the experiment, the median 


li 


D. Kahneman, J. L. Kitsch, and R. Thaler, 1990, “Experimental tests of the endow¬ 
ment effect and the Coase theorem,” Journal of Political Economy, 98, 1325-1348. 
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selling price was $5.79 and the median buying price was $2.25, a substantial 
difference. Apparently, the subjects with coffee mugs were more reluctant 
to part with them than subjects without mugs. Their preferences seemed to 
be influenced by their endowment, contrary to standard consumer theory. 

A similar effect shows up in what is known as the sunk cost fallacy. 
Once you have bought something, the amount you paid is “sunk,” or no 
longer recoverable. So future behavior should not be influenced by sunk 
costs. 

But, alas, real people tend to care about how much they paid for some¬ 
thing. Researchers have found that the price at which owners listed con¬ 
dominiums in Boston was highly correlated with the buying price. 12 As 
pointed out earlier, owners of stock are very reluctant to realize losses, even 
when it would be advantageous for tax reasons. 

The fact that ordinary people are subject to the sunk cost fallacy is in¬ 
teresting, but perhaps it is even more interesting that professionals are less 
susceptible to this problem. For example, the authors of the condominium 
example mentioned above found that individuals who bought condos for 
investment purposes were less likely to be influenced by sunk costs than 
individuals who lived in the condos. 

Similarly, financial advisers are seldom reluctant to realize losses, partic¬ 
ularly when there is a tax advantage to do so. It appears that one reason 
to hire professional advisers is to draw on their dispassionate analysis of 
decisions. 


30.3 Time 

Just as behavior involving uncertainty is subject to various forms of anoma¬ 
lous behavior, behavior involving time has its own set of anomalies. 


Discounting 

Consider, for example, time discounting. A standard model in economics, 
exponential discounting, posits that people discount the future at a 
constant fraction. If u(c) is the utility of consumption today, then the 
utility of consumption t years in the future looks like J*u(c), where S < 1. 

This is a mathematically convenient specification, but there are other 
forms of discounting that seem to fit the data better. 

One economist auctioned off bonds that paid off at various times in the 
future and found that people valued payment at future times less than the 


12 


David Genesove and Christopher Mayer, 2001, “Loss aversion and seller behavior: 
Evidence from the housing market,” Quarterly Journal of Economics, 116, 4, 1233- 
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exponential discounting theory would predict. An alternative theory, called 
hyperbolic discounting, suggests that the discount factor does not take 
the form 8 t but rather takes the form 1/(1 + kt). 

One particularly attractive feature of exponential discounting is that 
behavior is “time consistent.” Think about a person with a three-period 
planning horizon with utility function of the form 

■u(ci) + Su(c 2 ) + 5 2 u(c3). 


The marginal rate of substitution between periods 1 and 2 is 


MRS 12 = 


SMU{c 2 ) 
MU{c i) ’ 


while the MRS between periods 2 and 3 is 


MRS 23 = 


8 2 MU(c 3 ) 

8MU(c 2 ) 


SMU(c 3 ) 
MU(c 2 ) ‘ 


This last expression shows that the rate at which the individual is will¬ 
ing to substitute consumption in period 2 for consumption in period 3 is 
the same whether viewed from the perspective of period 1 or of period 2. 
This is not true for hyperbolic discounting. An individual with hyperbolic 
discounting discounts the long-term future more heavily than he discounts 
the short-term future. 

Such a person will exhibit time inconsistency: he may make a plan 
today about his future behavior, but when the future arrives he will want 
to do something different. Think of a couple who decide to spend $5,000 
on a trip to Europe rather than save their money. They rationalize their 
decision on the grounds that they will start saving next summer. But when 
next summer arrives, they decide to spend their money on a cruise. 


Self-control 

A closely related issue to the time consistency problem is the problem of 
self-control. Almost everyone faces this issue to some degree. We might 
vow to count our calories and eat less while standing on the bathroom 
scale, but our resolve can easily vanish when we sit down to a nice meal. 
Rational people are apparently slim and healthy, unlike the rest of us. 

One important question is whether people are aware of their own diffi¬ 
culties with self-control. If I know that I have a tendency to procrastinate, 
perhaps I should realize that when an important task comes along I should 
do it right away. Or if I have a tendency to overcommit myself, perhaps I 
should learn to say no more often. 
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But there is the other possibility. If I know that I am likely to yield 
to the temptation to have another desert tomorrow, I may as well have 
another desert today. The flesh is weak, but the spirit may be weak too. 

One way to deal with self-control is to find ways to commit yourself to 
future actions. That is, you can try to find a way to make it more costly 
to deviate from the desired action in the future. For example, people 
who make a public pronouncement about their future behavior might be 
less likely to deviate from their intended behavior. There are pills for 
alcoholics that make them violently sick if they drink alcohol. There are 
also commitment devices for dieters: someone who has his stomach 
stapled will be less likely to overeat. 

Contracts between individuals are there to ensure that people carry out 
their future intentions—even when it might not be attractive for them to 
do so due to changed conditions. In a similar way, people can hire others 
to impose costs on them if they deviate from intended actions, making, in 
effect, a contract with themselves. Dieting spas, exercise instructors, and 
tutors are forms of “purchased self-control.” 


EXAMPLE: Overconfidence 

An interesting variation on self-control is the phenomenon of overconfi¬ 
dence. Two financial economists, Brad Barber and Terrance Odean, stud¬ 
ied the performance of 66,465 households with discount brokerage accounts. 
During the period they studied, households that traded infrequently re¬ 
ceived an 18 percent return on their investments, while the return for the 
households that traded most actively was 11.3 percent. 

One of the most important factors that apparently influenced this exces¬ 
sive trading was gender: the men traded a lot more than women. Psychol¬ 
ogists commonly find that men tend to have excessive confidence in their 
own abilities, while women, for the most part, tend to be more realistic. 
Psychologists refer to men’s behavior as self-serving attribution bias. Basi¬ 
cally, men (or at least some men) tend to think their successes are a result 
of their own skill, rather than dumb luck, and so become overconfident. 

This overconfidence can have financial repercussions. In the sample of 
brokerage accounts, men traded 45 percent more than women. This exces¬ 
sive trading resulted in the average return to men that was a full percentage 
point lower than the return to women. As Barber and Odean put it, “trad¬ 
ing can be hazardous to your wealth.” 


30.4 Strategic Interaction and Social Norms 

A particularly interesting set of psychological, or perhaps sociological, be¬ 
haviors arise in strategic interaction. We have studied game theory, which 
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attempts to predict how rational players should interact. But there is also 
a subject known as behavioral game theory that examines how actual 
people interact. Indeed, there are systematic and strong deviations from 
the pure theory. 


Ultimatum Came 

Consider the ultimatum game, which was discussed briefly in the last 
chapter. As you will recall, this is a game with two players, the proposer 
and the responder. The proposer is given $10 and asked to propose a 
division between himself and the responder. The responder is then shown 
the division and asked whether or not he wishes to accept it. If he accepts, 
the division is carried out; if he refuses the division, both people walk away 
with nothing. 

Let’s first think about how fully rational players might act. Once the 
responder sees the division, he has a dominant strategy: accept the money 
as long as he gets anything at all. After all, suppose I offer you the choice 
between 10 cents and nothing. Wouldn’t you rather have 10 cents than 
nothing at all? 

Given that a rational responder will choose any amount, the divider 
should choose the minimal amount to give him—say, a penny. So the 
outcome predicted by game theory is an extreme split: the divider will end 
up with almost everything. 

This isn’t the way things turn out when the game is actually played. In 
fact, responders tend to reject offers that they perceive as unfair. Offers 
that give the responder less than 30 percent of the amount to be divided 
are rejected more than 50 percent of the time. 

Of course, if the divider recognizes that the responder will reject “unfair” 
offers, the divider will rationally want to make a division that is closer to 
equal. The average division tends to be about 45 percent to the responder 
and 55 percent to the divider, with about 16 percent of the offers being 
rejected. 

There has been a considerable amount of literature examining how the 
characteristics of the players affect the outcome of the game. One example 
is gender differences: it appears that men tend to receive more favorable 
divisions, particularly when the divisions are made by women. 

Cultural differences can also be important. It appears that some cul¬ 
tures value fairness more than others, inducing people to reject offers that 
are perceived as unfair. 13 Interestingly enough, the offered amounts don’t 
vary much from region to region and culture to culture, while there are 


13 See Swee-Hoon Chuah, Robert Hoffman, Martin Jones, and Geoffrey Williams, “Do 
Cultures Clash? Evidence from Cross-National Ultimatum Game Experiments,” Not¬ 
tingham University Business School working paper. 
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systematic differences in the divisions that are acceptable. The size of the 
pie is also important. If the size of the pie is $10, you might be reluctant 
to accept $1. But if the size of the pie is $1,000, would you be willing to 
reject $100? Apparently, responders do find it difficult to turn down larger 
amounts of money. 

Another variation is in the design of the game. In one variant, the so- 
called strategy method, the responders are asked to name the minimal 
division that they will accept before seeing the amount they are offered. The 
proposers are aware that the decision will be made in advance but, of course, 
don’t know what the minimum acceptable division is. This experimental 
design tends to increase the amounts that the proposers offer; that is, it 
tends to make the divisions more equal. 


Fairness 

One effect at work in the ultimatum game seems to be a concern for fairness. 
Most people seem to have a natural bias towards equal (or at least not too 
unequal) division. This is not simply an individual phenomenon, but a 
social phenomenon. People will enforce fairness norms even when it is 
not directly in their interest to do so. 

Consider, for example, punishment games, which are a generalization 
of ultimatum games with a third party who observes the choices made by 
the proposer/divider. The third party can choose, at some cost to himself, 
to deduct some of the proposer’s profits. 14 

Experimenters have found that around 60 percent of these third-party 
observers will actually punish those who make unfair divisions. There seems 
to be something in the human makeup—whether innate or learned—that 
finds unfair behavior objectionable. 

Indeed, there are differences across cultures with respect to social norms 
for fairness; individuals in some societies seem to value it highly, while in 
other societies fairness is less strongly valued. However, the urge to punish 
those who are unfair is widely felt. It has been suggested that a predilec¬ 
tion towards “fair” outcomes is part of human nature, perhaps because 
individuals that behaved fairly towards each other had higher chances of 
surviving and reproducing. 


30.5 Assessment of Behavioral Economics 

Psychologists, marketers, and behavioral economists have amassed a vari¬ 
ety of examples showing how the basic theory of economic choice is wrong, 
or, at least, incomplete. 


14 


See Ernst Fehr and Urs Fischbacher, 2004, “Third-party punishment and social 
norms,” Evolution and Human Behavior , 25, 63-87. 
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Some of these examples appear to be “optical illusions.” For example, 
the fact that framing a choice problem differently can affect decisions is 
similar to the fact that human judgment of sizes and distances can be 
affected by how figures are drawn. If people took the time to consider the 
choices carefully—applying a measuring stick of dispassionate reasoning— 
they would reach the right conclusion. 

Though it is undoubtedly true that people don’t behave completely in 
accord with the simplest theories of economic behavior, one still might 
respond that no theory is 100 percent correct. Psychologists have also doc¬ 
umented that people don’t really understand simple principles of physics. 
Example: If you tie a weight to the end of a rope, swing it around your 
head in a circle and then let go, which way will the weight fly? 

Many people say that the weight will fly radially outward rather than 
the correct response that the weight will move tangentially to the circle. 15 
Of course, people have lived in the physical world their entire lives. If they 
occasionally misunderstand how it works, we shouldn’t be too surprised 
when people misunderstand the economic world. 

Apparently our intuitive understanding of physics is good enough for 
everyday life, and even the demands of amateur and professional sports: a 
baseball player may not be able to describe how a ball will travel, despite 
the fact that he can throw it well. Similarly, one might argue that people 
tend to be pretty good at the sorts of day-to-day decisions they are forced 
to make, even if they aren’t very good at abstract reasoning about them. 

Another reaction to behavioral anomalies is that markets tend to reward 
rational behavior, while punishing irrationality. Even if many participants 
do not behave rationally, those who do behave sensibly will have the biggest 
effect on prices and outcomes. There is likely some truth to this view 
as well. Recall the example that real estate investors seemed to be less 
influenced by sunk costs than ordinary individuals. 

In addition, you can hire experts to help you make better decisions. Diet 
consultants and financial advisers can offer objective advice about how to 
eat and how to invest. If you are worried about being too fair, you can 
always hire a tough negotiator. 

Returning to the optical illusion example, the reason that we use rulers 
and yardsticks is that we learn not to trust our own eyes. Similarly, in 
making important decisions it is prudent to consult the views of objective 
experts. 


Summary 

1. Behavioral economics is concerned with how consumers make choices in 
reality. 


15 


See M. McCloskey, 1983, “Intuitive Physics,” Scientific American, April, 114-123. 
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2. In many cases, actual consumer behavior is different from that predicted 
by the simple model of the rational consumer. 

3. Consumers make different choices depending on how a problem is framed 
or presented. 

4. Choice behavior can be particularly problematic in choices involving 
uncertainty. 

5. Consumers seem to have a preference for “fair” divisions and will punish 
those who behave unfairly. 
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REVIEW QUESTIONS 


1. Subjects are allowed to buy tickets in a lottery. One group is told that 
they have a 55 percent chance of winning, the other group is told that they 
have a 45 percent chance of not winning. Which group is more likely to 
buy lottery tickets? What is the name for this effect? 

2. Mary plans the entire week’s meals for her family, while Fred shops each 
day. Which is likely to produce more varied meals? What is this effect 
called? 

3. You are the human resources director for a medium-size company and 
are trying to decide how many mutual funds to offer in your employees’ 
pension plan. Would it be better to offer 10 choices or 50 choices? 

4. What is the probability that a fair coin will come up heads three times 
in a row when tossed? 

5. John decides that he will save $5 this week and $10 next week. But 
when next week arrives, he decides to save only $8. What is the term used 
to describe this sort of inconsistent behavior? 


CHAPTER 



EXCHANGE 


Up until now we have generally considered the market for a single good in 
isolation. We have viewed the demand and supply functions for a good as 
depending on its price alone, disregarding the prices of other goods. But in 
general the prices of other goods will affect people’s demands and supplies 
for a particular good. Certainly the prices of substitutes and complements 
for a good will influence the demand for it, and, more subtly, the prices 
of goods that people sell will affect the amount of income they have and 
thereby influence how much of other goods they will be able to buy. 

Up until now we have been ignoring the effect of these other prices on 
the market equilibrium. When we discussed the equilibrium conditions in a 
particular market, we only looked at part of the problem: how demand and 
supply were affected by the price of the particular good we were examining. 
This is called partial equilibrium analysis. 

In this chapter we will begin our study of general equilibrium analysis: 
how demand and supply conditions interact in several markets to determine 
the prices of many goods. As you might suspect, this is a complex problem, 
and we will have to adopt several simplifications in order to deal with it. 

First, we will limit our discussion to the behavior of competitive markets, 
so that each consumer or producer will take prices as given and optimize 
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accordingly. The study of general equilibrium with imperfect competition 
is very interesting but too difficult to examine at this point. 

Second, we will adopt our usual simplifying assumption of looking at the 
smallest number of goods and consumers that we possibly can. In this case, 
it turns out that many interesting phenomena can be depicted using only 
two goods and two consumers. All of the aspects of general equilibrium 
analysis that we will discuss can be generalized to arbitrary numbers of 
consumers and goods, but the exposition is simpler with two of each. 

Third, we will look at the general equilibrium problem in two stages. 
We will start with an economy where people have fixed endowments of 
goods and examine how they might trade these goods among themselves; 
no production will be involved. This case is naturally known as the case 
of pure exchange. Once we have a clear understanding of pure exchange 
markets we will examine production behavior in the general equilibrium 
model. 


31.1 The Edgeworth Box 

There is a convenient graphical tool known as the Edgeworth box that 
can be used to analyze the exchange of two goods between two people. 1 The 
Edgeworth box allows us to depict the endowments and preferences of two 
individuals in one convenient diagram, which can be used to study various 
outcomes of the trading process. In order to understand the construction 
of an Edgeworth box it is necessary to examine the indifference curves and 
the endowments of the people involved. 

Let us call the two people involved A and B and the two goods involved 
1 and 2. We will denote A’s consumption bundle by Xa = {x\, x\), where 
x\ represents A’s consumption of good 1 and x\ represents A’s consump¬ 
tion of good 2. Then B’s consumption bundle is denoted by Xb = (xg,x^). 
A pair of consumption bundles, Xa and Xb , is called an allocation. An 
allocation is a feasible allocation if the total amount of each good con¬ 
sumed is equal to the total amount available: 

x A + 

X A + *5 = U>a + 

A particular feasible allocation that is of interest is the initial endow¬ 
ment allocation, {u) 1 a ,uj 2 a ) and This is the allocation that the 

consumers start with. It consists of the amount of each good that con¬ 
sumers bring to the market. They will exchange some of these goods with 
each other in the course of trade to end up at a final allocation. 

1 The Edgeworth box is named in honor of Francis Ysidro Edgeworth (1845—1926), an 

English economist who was one of the first to use this analytical tool. 
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Figure 

31.1 


The Edgeworth box shown in Figure 31.1 can be used to illustrate these 
concepts graphically. We first use a standard consumer theory diagram 
to illustrate the endowment and preferences of consumer A. We can also 
mark off on these axes the total amount of each good in the economy— 
the amount that A has plus the amount that B has of each good. Since 
we will only be interested in feasible allocations of goods between the two 
consumers, we can draw a box that contains the set of possible bundles of 
the two goods that A can hold. 


xj -<- -Person 



Ail Edgeworth box. The width of the box measures the 
total amount of good 1 in the economy and the height measures 
the total amount of good 2. Person A’s consumption choices 
are measured from the lower left-hand corner while person B’s 
choices are measured from the upper right. 


Note that the bundles in this box also indicate the amount of the goods 
that B can hold. If there are 10 units of good 1 and 20 units of good 2, 
then if A holds (7,12), B must be holding (3,8). We can depict how much 
A holds of good 1 by the distance along the horizontal axis from the origin 
in the lower left-hand corner of the box and the amount B holds of good 1 
by measuring the distance along the horizontal axis from the upper right- 
hand corner. Similarly, distances along the vertical axes give the amounts 
of good 2 that A and B hold. Thus the points in this box give us both the 
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bundles that A can hold and the bundles that B can hold—just measured 
from different origins. The points in the Edgeworth box can represent all 
feasible allocations in this simple economy. 

We can depict A’s indifference curves in the usual manner, but B’s indif¬ 
ference curves take a somewhat different form. To construct them we take 
a standard diagram for B’s indifference curves, turn it upside down, and 
“overlay” it on the Edgeworth box. This gives us B’s indifference curves 
on the diagram. If we start at A’s origin in the lower left-hand corner and 
move up and to the right, we will be moving to allocations that are more 
preferred by A. As we move down and to the left we will be moving to 
allocations that are more preferred by B. (If you rotate your book and look 
at the diagram, this discussion may seem clearer.) 

The Edgeworth box allows us to depict the possible consumption bundles 
for both consumers—the feasible allocations—and the preferences of both 
consumers. It thereby gives a complete description of the economically 
relevant characteristics of the two consumers. 


31.2 Trade 

Now that we have both sets of preferences and endowments depicted we 
can begin to analyze the question of how trade takes place. We start 
at the original endowment of goods, denoted by the point W in Figure 
31.1. Consider the indifference curves of A and B that pass through this 
allocation. The region where A is better off than at her endowment consists 
of all the bundles above her indifference curve through W. The region 
where B is better off than at his endowment consists of all the allocations 
that are above—from his point of view —his indifference curve through W. 
(This is below his indifference curve from our point of view ... unless you’ve 
still got your book upside down.) 

Where is the region of the box where A and B are both made better 
off? Clearly it is in the intersection of these two regions. This is the lens¬ 
shaped region illustrated in Figure 31.1. Presumably in the course of their 
negotiations the two people involved will find some mutually advantageous 
trade—some trade that will move them to some point inside the lens-shaped 
area such as the point M in Figure 31.1. 

The particular movement to M depicted in Figure 31.1 involves person 
A giving up \x A — u>\\ units of good 1 and acquiring in exchange \x 2 A — lo\\ 
units of good 2. This means that B acquires \x x B — oj b \ units of good 1 and 
gives up \x B — uj b \ units of good 2. 

There is nothing particularly special about the allocation M. Any allo¬ 
cation inside the lens-shaped region would be possible—for every allocation 
of goods in this region is an allocation that makes each consumer better off 
than he or she was at the original endowment. We only need to suppose 
that the consumers trade to some point in this region. 
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Figure 

31.2 


Now we can repeat the same analysis at the point M. We can draw the 
two indifference curves through M, construct a new lens-shaped “region of 
mutual advantage,” and imagine the traders moving to some new point N 
in this region. And so it goes ... the trade will continue until there are no 
more trades that are preferred by both parties. What does such a position 
look like? 


31.3 Pareto Efficient Allocations 

The answer is given in Figure 31.2. At the point M in this diagram the set of 
points above A’s indifference curve doesn’t intersect the set of points above 
B’s indifference curve. The region where A is made better off is disjoint 
from the region where B is made better off. This means that any movement 
that makes one of the parties better off necessarily makes the other party 
worse off. Thus there are no exchanges that are advantageous for both 
parties. There are no mutually improving trades at such an allocation. 

An allocation such as this is known as a Pareto efficient allocation. 
The idea of Pareto efficiency is a very important concept in economics that 
arises in various guises. 



A Pareto efficient allocation. At a Pareto efficient alloca¬ 
tion such as M, each person is on his highest possible indiffer¬ 
ence curve, given the indifference curve of the other person. The 
line connecting such points is known as the contract curve. 
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A Pareto efficient allocation can be described as an allocation where: 

1. There is no way to make all the people involved better off; or 

2. there is no way to make some individual better off without making 
someone else worse off; or 

3. all of the gains from trade have been exhausted; or 

4. there are no mutually advantageous trades to be made, and so on. 

Indeed we have mentioned the concept of Pareto efficiency several times 
already in the context of a single market: we spoke of the Pareto efficient 
level of output in a single market as being that amount of output where 
the marginal willingness to buy equaled the marginal willingness to sell. 
At any level of output where these two numbers differed, there would be a 
way to make both sides of the market better off by carrying out a trade. 
In this chapter we will examine more deeply the idea of Pareto efficiency 
involving many goods and many traders. 

Note the following simple geometry of Pareto efficient allocations: the 
indifference curves of the two agents must be tangent at any Pareto efficient 
allocation in the interior of the box. It is easy to see why. If the two 
indifference curves are not tangent at an allocation in the interior of the box, 
then they must cross. But if they cross, then there must be some mutually 
advantageous trade—so that point cannot be Pareto efficient. (It is possible 
to have Pareto efficient allocations on the sides of the box—where one 
consumer has zero consumption of some good—in which the indifference 
curves are not tangent. These boundary cases are not important for the 
current discussion.) 

From the tangency condition it is easy to see that there are a lot of Pareto 
efficient allocations in the Edgeworth box. In fact, given any indifference 
curve for person A, for example, there is an easy way to find a Pareto 
efficient allocation. Simply move along A’s indifference curve until you 
find the point that is the best point for B. This will be a Pareto efficient 
point, and thus both indifference curves must be tangent at this point. 

The set of all Pareto efficient points in the Edgeworth box is known as 
the Pareto set, or the contract curve. The latter name comes from 
the idea that all “final contracts” for trade must lie on the Pareto set— 
otherwise they wouldn’t be final because there would be some improvement 
that could be made! 

In a typical case the contract curve will stretch from A’s origin to B’s 
origin across the Edgeworth box, as shown in Figure 31.2. If we start at A’s 
origin, A has none of either good and B holds everything. This is Pareto 
efficient since the only way A can be made better off is to take something 
away from B. As we move up the contract curve A is getting more and 
more well-off until we finally get to B’s origin. 
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The Pareto set describes all the possible outcomes of mutually advanta¬ 
geous trade from starting anywhere in the box. If we are given the starting 
point—the initial endowments for each consumer—we can look at the sub¬ 
set of the Pareto set that each consumer prefers to his initial endowment. 
This is simply the subset of the Pareto set that lies in the lens-shaped re¬ 
gion depicted in Figure 31.1. The allocations in this lens-shaped region are 
the possible outcomes of mutual trade starting from the particular initial 
endowment depicted in that diagram. But the Pareto set itself doesn’t 
depend on the initial endowment, except insofar as the endowment de¬ 
termines the total amounts of both goods that are available and thereby 
determines the dimensions of the box. 


31.4 Market Trade 

The equilibrium of the trading process described above—the set of Pareto 
efficient allocations—is very important, but it still leaves a lot of ambiguity 
about where the agents end up. The reason is that the trading process we 
have described is very general. Essentially we have only assumed that the 
two parties will move to some allocation where they are both made better 
off. 

If we have a particular trading process, we will have a more precise 
description of equilibrium. Let’s try to describe a trading process that 
mimics the outcome of a competitive market. 

Suppose that we have a third party who is willing to act as an “auction¬ 
eer” for the two agents A and B. The auctioneer chooses a price for good 1 
and a price for good 2 and presents these prices to the agents A and B. 
Each agent then sees how much his or her endowment is worth at the prices 
(pi, ) and decides how much of each good he or she would want to buy 

at those prices. 

One warning is in order here. If there are really only two people involved 
in the transaction, then it doesn’t make much sense for them to behave in 
a competitive manner. Instead they would probably attempt to bargain 
over the terms of trade. One way around this difficulty is to think of the 
Edgeworth box as depicting the average demands in an economy with only 
two types of consumers, but with many consumers of each type. Another 
way to deal with this is to point out that the behavior is implausible in 
the two-person case, but it makes perfect sense in the many-person case, 
which is what we are really concerned with. 

Either way, we know how to analyze the consumer-choice problem in this 
framework -it is just the standard consumer-choice problem we described 
in Chapter 5. In Figure 31.3 we illustrate the two demanded bundles of 
the two agents. (Note that the situation depicted in Figure 31.3 is not an 
equilibrium configuration since the demand by one agent is not equal to 
the supply of the other agent.) 
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Person 



Gross demands and net demands. Gross demands are the 
amounts the person wants to consume; net demands are the 
amounts the person wants to purchase. 


Figure 

31.3 


As in Chapter 9 there are two relevant concepts of “demand” in this 
framework. The gross demand of agent A for good 1, say, is the total 
amount of good 1 that he wants at the going prices. The net demand 
of agent A for good 1 is the difference between this total demand and 
the initial endowment of good 1 that agent A holds. In the context of 
general equilibrium analysis, net demands are sometimes called excess 
demands. We will denote the excess demand of agent A for good 1 by 
e\. By definition, if A’s gross demand is x \, and his endowment is we 
have 

e A= x A~ W A- 

The concept of excess demand is probably more natural, but the concept 
of gross demand is generally more useful. We will typically use the word 
“demand” to mean gross demand and specifically say “net demand” or 
“excess demand” if that is what we mean. 

For arbitrary prices (pi,P 2 ) there is no guarantee that supply will equal 
demand—in either sense of demand. In terms of net demand, this means 
that the amount that A wants to buy (or sell) will not necessarily equal 
the amount that B wants to sell (or buy). In terms of gross demand, this 
means that the total amount that the two agents want hold of the goods is 
not equal to the total amount of that goods available. Indeed, this is true 
in the example depicted in Figure 31.3. In this example the agents will not 
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be able to complete their desired transactions: the markets will not clear. 

We say that in this case the market is in disequilibrium. In such 
a situation, it is natural to suppose that the auctioneer will change the 
prices of the goods. If there is excess demand for one of the goods, the 
auctioneer will raise the price of that good, and if there is excess supply 
for one of the goods, the auctioneer will lower its price. 

Suppose that this adjustment process continues until the demand for 
each of the goods equals the supply. What will the final configuration look 
like? 

The answer is given in Figure 31.4. Here the amount that A wants to 
buy of good 1 just equals the amount that B wants to sell of good 1, and 
similarly for good 2. Said another way, the total amount that each person 
wants to buy of each good at the current prices is equal to the total amount 
available. We say that the market is in equilibrium. More precisely, 
this is called a market equilibrium, a competitive equilibrium, or a 
Walrasian equilibrium. 2 Each of these terms refers to the same thing: a 
set of prices such that each consumer is choosing his or her most-preferred 
affordable bundle, and all consumers’ choices are compatible in the sense 
that demand equals supply in every market. 

We know that if each agent is choosing the best bundle that he can afford, 
then his marginal rate of substitution between the two goods must be equal 
to the ratio of the prices. But if all consumers are facing the same prices, 
then all consumers will have to have the same marginal rate of substitution 
between each of the two goods. In terms of Figure 31.4, an equilibrium has 
the property that each agent’s indifference curve is tangent to his budget 
line. But since each agent’s budget line has the slope —pi/p 2 , this means 
that the two agents’ indifference curves must be tangent to each other. 


31.5 The Algebra of Equilibrium 

If we let x\(pi,p 2 ) be agent A’s demand function for good 1 and Xg(pi,P 2 ) 
be agent B’s demand function for good 1, and define the analogous expres¬ 
sions for good 2, we can describe this equilibrium as a set of prices (p*, p \) 
such that 

x\{p\,pVi +x 1 b (p* 1 ,pZ) =uj^+ojI j 
%a(Pi,P2) + Xb(Pi,P2) =^a + Vb- 

These equations say that in equilibrium the total demand for each good 
should be equal to the total supply. 


2 Leon Walras (1834-1910) was a French economist at Lausanne who was an early 
investigator of general equilibrium theory. 
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x' B -<-I 'erson 



Equilibrium in the Edgeworth box. In equilibrium, each 
person is choosing the most-preferred bundle in his budget set, 
and the choices exhaust the available supply. 


Figure 

31.4 


Another way to describe the equilibrium is to rearrange these two equa¬ 
tions to get 

[xAi.P 1 .P 2 ) ~ + [xb(Pi,P2) - Wb] = 0 

[x\(plp* 2 ) - u\\ + [xl{pl,p* 2 ) - uj 2 b \ = 0 . 

These equations say that the sum of net demands of each agent for each 
good should be zero. Or, in other words, the net amount that A chooses 
to demand (or supply) must be equal to the net amount that B chooses to 
supply (or demand). 

Yet another formulation of these equilibrium equations comes from the 
concept of the aggregate excess demand function. Let us denote the 
net demand function for good 1 by agent A by 

e- l AiPuP 2 ) = x 1 A {pi,p 2 ) - u> A 

and define e B (pi,p 2 ) in a similar manner. 

The function e\(jp\ 1 p 2 ) measures agent A’s net demand or his excess 
demand -the difference between what she wants to consume of good 1 
and what she initially has of good 1. Now let us add together agent A’s 
net demand for good 1 and agent B’s net demand for good 1. We get 

ziiPuPi) = Ca(pi,P 2) + es(pi,p 2 ) 

= x\(pi,p 2 ) + x B (p 1: p 2 ) -uj \4 - U)g, 
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which we call the aggregate excess demand for good 1. There is a 
similar aggregate excess demand for good 2, which we denote by z 2 (pi,p 2 ). 

Then we can describe an equilibrium {p\,p 2 ) by saying that the aggregate 
excess demand for each good is zero: 

z i{P*nP*2) = 0 
MPitPI) = o. 

Actually, this definition is stronger than necessary. It turns out that if 
the aggregate excess demand for good 1 is zero, then the aggregate excess 
demand for good 2 must necessarily be zero. In order to prove this, it 
is convenient to first establish a property of the aggregate excess demand 
function known as Walras’ law. 


31.6 Walras' Law 

Using the notation established above, Walras’ law states that 
piz 1 (pi 1 p 2 ) + P 2 Z 2 (pi,P 2 ) = 0. 

That is, the value of aggregate excess demand is identically zero. To say 
that the value of aggregate demand is identically zero means that it is zero 
for all possible choices of prices, not just equilibrium prices. 

The proof of this follows from adding up the two agents’ budget con¬ 
straints. Consider first agent A. Since her demand for each good satisfies 
her budget constraint, we have 

Pix\{pi,p 2 ) + p 2 x 2 A (pi,p 2 ) = p 1 u\ + p 2 u\ 

or 

pA x \{pi,P2) -Va] +P2[x 2 a(piiP 2 ) - v 2 a] = o 

Pie\(pi,p 2 ) +P2e 2 A {pi,P 2 ) = 0. 

This equation says that the value of agent A’s net demand is zero. That 
is, the value of how much A wants to buy of good 1 plus the value of how 
much she wants to buy of good 2 must equal zero. (Of course the amount 
that she wants to buy of one of the goods must be negative—that is, she 
intends to sell some of one of the goods to buy more of the other.) 

We have a similar equation for agent B: 

Pi[x B (pi,p 2 ) -Wb] + P 2 [x 2 B (pi,p 2 ) ~u) 2 b \ = 0 
Pie B {pi,p 2 ) +P 2 e B {pi,p 2 ) = 0. 

Adding the equations for agent A and agent B together and using the 
definition of aggregate excess demand, Z\(pi,p 2 ) and z 2 (pi,p 2 ), we have 

Pi[e 1 A {pi,P 2 ) + e B {p!,p 2 )\ + p 2 [e 2 A (pi 1 p 2 ) + e|(pi,p 2 )] = 0 

pizi(pi,p 2 ) +p 2 z 2 (pi,p 2 ) = 0. 
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Now we can see where Walras’ law comes from: since the value of each 
agent’s excess demand equals zero, the value of the sum of the agents’ 
excess demands must equal zero. 

We can now demonstrate that if demand equals supply in one market, 
demand must also equal supply in the other market. Note that Walras’ 
law must hold for all prices, since each agent must satisfy his or her budget 
constraint for all prices. Since Walras’ law holds for all prices, in particular, 
it holds for a set of prices where the excess demand for good 1 is zero: 

zi(p* 1 ,p* 2 ) = 0. 

According to Walras’ law it must also be true that 


PlZliPuPz) +P2 z 2(P*l,P2) = 0 . 


It easily follows from these two equations that if p 2 > 0, then we must have 

*2(Pi,pS) = o. 

Thus, as asserted above, if we find a set of prices (pi-P^) where the 
demand for good 1 equals the supply of good 1, we are guaranteed that 
the demand for good 2 must equal the supply of good 2. Alternatively, if 
we find a set of prices where the demand for good 2 equals the supply of 
good 2, we are guaranteed that market 1 will be in equilibrium. 

In general, if there are markets for k goods, then we only need to find 
a set of prices where k — 1 of the markets are in equilibrium. Walras’ law 
then implies that the market for good k will automatically have demand 
equal to supply. 


31.7 Relative Prices 

As we’ve seen above, Walras’ law implies that there are only k — 1 indepen¬ 
dent equations in a fc-good general equilibrium model: if demand equals 
supply in k — 1 markets, demand must equal supply in the final market. 
But if there are k goods, there will be k prices to be determined. How can 
you solve for k prices with only k — 1 equations? 

The answer is that there are really only k — 1 independent prices. We 
saw in Chapter 2 that if we multiplied all prices and income by a positive 
number t, then the budget set wouldn’t change, and thus the demanded 
bundle wouldn’t change either. In the general equilibrium model, each 
consumer’s income is just the value of his or her endowment at the market 
prices. If we multiply all prices by t > 0, we will automatically multiply 
each consumer’s income by t. Thus, if we find some equilibrium set of 
prices {p\,p 2 ), then (tp^tp^) are equilibrium prices as well, for any t > 0. 
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This means that we are free to choose one of the prices and set it equal to 
a constant. In particular it is often convenient to set one of the prices equal 
to 1 so that all of the other prices can be interpreted as being measured 
relative to it. As we saw in Chapter 2, such a price is called a numeraire 
price. If we choose the first price as the numeraire price, then it is just like 
multiplying all prices by the constant t = 1/pi- 

The requirement that demand equal supply in every market can only be 
expected to determine the equilibrium relative prices, since multiplying all 
prices by a positive number will not change anybody’s demand and supply 
behavior. 

EXAMPLE: An Algebraic Example of Equilibrium 

The Cobb-Douglas utility function described in Chapter 6 has the form 
u A (x\, x\) = (x\) a (x\) 1 ~ a for person A, and a similar form for person B. 
We saw there that this utility function gave rise to the following demand 
functions: 


x 1 A (pi,P2,m A ) = a - 



XB(Pi,P2,m B ) = b— 
Pi 



where a and b are the parameters of the two consumers’ utility functions. 

We know that in equilibrium, the money income of each individual is 
given by the value of his or her endowment: 


m A = Piu\ + p 2 u}\ 
m B = PiWb + p 2 to 2 B . 


Thus the aggregate excess demands for the two goods are 



— — Ug 


a Piu A +p 2 u A 

Pi 


b Pl^B + P2^b 

Pi 


— 0J A — U>B 


and 



2 2 
-U) A -U) B . 


P2 


P2 
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You should verify that these aggregate demand functions satisfy Walras’ 
law. 

Let us choose P 2 as the numeraire price, so that these equations become 


, -.X Pl^A + ^A , uPl^B+^B 1 1 

zi(pi, 1) = a ---- + b ---- -u l a -uj l b 


Pi 


Pi 


Zi{PiA) = (l-a)(p 1 u\ + u\) + (l-b)(p 1 w 1 B +u%)-u\-w%. 

All we’ve done here is set P 2 = 1. 

We now have an equation for the excess demand for good 1 , z\(p\,l), 
and an equation for the excess demand for good 2, z^ipi , 1), with each 
equation expressed as a function of the relative price of good 1, p\. In 
order to find the equilibrium price, we set either of these equations equal 
to zero and solve for pi. According to Walras’ law, we should get the same 
equilibrium price, no matter which equation we solve. 

The equilibrium price turns out to be 


Pi = 


au>. 


bui 2 B 


(1 — a)uj\ + (1 — b)w 


(Skeptics may want to insert this value ofpi into the demand equals supply 
equations to verify that the equations are satisfied.) 


31.8 The Existence of Equilibrium 

In the example given above, we had specific equations for each consumer’s 
demand function and we could explicitly solve for the equilibrium prices. 
But in general, we don’t have explicit algebraic formulas for each con¬ 
sumer’s demands. We might well ask how do we know that there is any set 
of prices such that demand equals supply in every market? This is known 
as the question of the existence of a competitive equilibrium. 

The existence of a competitive equilibrium is important insofar as it 
serves as a “consistency check” for the various models that we have ex¬ 
amined in previous chapters. What use would it be to build up elaborate 
theories of the workings of a competitive equilibrium if such an equilibrium 
commonly did not exist? 

Early economists noted that in a market with k goods there were k — 1 
relative prices to be determined, and there were k — 1 equilibrium equations 
stating that demand should equal supply in each market. Since the number 
of equations equaled the number of unknowns, they asserted that there 
would be a solution where all of the equations were satisfied. 

Economists soon discovered that such arguments were fallacious. Merely 
counting the number of equations and unknowns is not sufficient to prove 
that an equilibrium solution will exist. However, there are mathematical 
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tools that can be used to establish the existence of a competitive equilib¬ 
rium. The crucial assumption turns out to be that the aggregate excess 
demand function is a continuous function. This means, roughly speak¬ 
ing, that small changes in prices should result in only small changes in 
aggregate demand: a small change in prices should not result in a big jump 
in the quantity demanded. 

Under what conditions will the aggregate demand functions be continu¬ 
ous? Essentially there are two kinds of conditions that will guarantee con¬ 
tinuity. One is that each individual’s demand function be continuous—that 
small changes in prices will lead to only small changes in demand. This 
turns out to require that each consumer have convex preferences, which 
we discussed in Chapter 3. The other condition is more general. Even 
if consumers themselves have discontinuous demand behavior, as long as 
all consumers are small relative to the size of the market, the aggregate 
demand function will be continuous. 

This latter condition is quite nice. After all, the assumption of competi¬ 
tive behavior only makes sense when there are a lot of consumers who are 
small relative to the size of the market. This is exactly the condition that 
we need in order to get the aggregate demand functions to be continuous. 
And continuity is just the ticket to ensure that a competitive equilibrium 
exists. Thus the very assumptions that make the postulated behavior rea¬ 
sonable will ensure that the equilibrium theory will have content. 


31.9 Equilibrium and Efficiency 

We have now analyzed market trade in a pure exchange model. This gives 
us a specific model of trade that we can compare to the general model of 
trade that we discussed in the beginning of this chapter. One question 
that might arise about the use of a competitive market is whether this 
mechanism can really exhaust all of the gains from trade. After we have 
traded to a competitive equilibrium where demand equals supply in every 
market, will there be any more trades that people will desire to carry out? 
This is just another way to ask whether the market equilibrium is Pareto 
efficient: will the agents desire to make any more trades after they have 
traded at the competitive prices? 

We can see the answer by inspecting Figure 31.4: it turns out that the 
market equilibrium allocation is Pareto efficient. The proof is this: an 
allocation in the Edgeworth box is Pareto efficient if the set of bundles 
that A prefers doesn’t intersect the set of bundles that B prefers. But at 
the market equilibrium, the set of bundles preferred by A must lie above her 
budget set, and the same thing holds for B, where “above” means “above 
from B’s point of view.” Thus the two sets of preferred allocations can’t 
intersect. This means that there are no allocations that both agents prefer 
to the equilibrium allocation, so the equilibrium is Pareto efficient. 


THE ALGEBRA OF EFFICIENCY 597 


31.10 The Algebra of Efficiency 

We can also show this algebraically. Suppose that we have a market equi¬ 
librium that is not Pareto efficient. We will show that this assumption 
leads to a logical contradiction. 

To say that the market equilibrium is not Pareto efficient means that 
there is some other feasible allocation (y\,y%yg,y%) such that 


y\ + Vb ~ 

(31.1) 

y\ + Vb = 

(31.2) 

(Va^Va) A (, x Ai x Al) 

(31.3) 

(Z/Bi2/b) ^ B (^Bj^b). 

(31.4) 


The first two equations say that the y-allocation is feasible, and the next 
two equations say that it is preferred by each agent to the ^-allocation. 
(The symbols and )^b refer to the preferences of agents A and B.) 

But by hypothesis, we have a market equilibrium where each agent is 
purchasing the best bundle he or she can afford. If {y\,ij\) is better than 
the bundle that A is choosing, then it must cost more than A can afford, 
and similarly for B: 


PWa + P2Va > Pi^A + P2 ^a 

PiyB+P2y 2 B > Pl^B + P2V%- 
Now add these two equations together to get 

pi{y\ + vb) +P2(y 2 A + vb) > pi(v\ + 4) + P2{^a + w|). 
Substitute from equations (31.1) and (31.2) to get 

Pi{^A + w b) + P2(p>\ + w s) > Pii^A + w b) + P2{^>a + w b)j 

which is clearly a contradiction, since the left-hand side and the right-hand 
side are the same. 

We derived this contradiction by assuming that the market equilibrium 
was not Pareto efficient. Therefore, this assumption must be wrong. It 
follows that all market equilibria are Pareto efficient: a result known as the 
First Theorem of Welfare Economics. 

The First Welfare Theorem guarantees that a competitive market will 
exhaust all of the gains from trade: an equilibrium allocation achieved by a 
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set of competitive markets will necessarily be Pareto efficient. Such an al¬ 
location may not have any other desirable properties, but it will necessarily 
be efficient. 

In particular, the First Welfare Theorem says nothing about the distri¬ 
bution of economic benefits. The market equilibrium might not be a “just” 
allocation—if person A owned everything to begin with, then she would 
own everything after trade. That would be efficient, but it would probably 
not be very fair. But, after all, efficiency does count for something, and 
it is reassuring to know that a simple market mechanism like the one we 
have described is capable of achieving an efficient allocation. 


EXAMPLE: Monopoly in the Edgeworth Box 

In order to understand the First Welfare Theorem better, it is useful to 
consider another resource allocation mechanism that does not lead to effi¬ 
cient outcomes. A nice example of this occurs when one consumer attempts 
to behave as a monopolist. Suppose now that there is no auctioneer and 
that instead, agent A is going to quote prices to agent B, and agent B will 
decide how much he wants to trade at the quoted prices. Suppose further 
that A knows B’s “demand curve” and will attempt to choose the set of 
prices that makes A as well-off as possible, given the demand behavior of B. 

In order to examine the equilibrium in this process, it is appropriate 
to recall the definition of a consumer’s price offer curve. The price offer 
curve, which we discussed in Chapter 6, represents all of the optimal choices 
of the consumer at different prices. B’s offer curve represents the bundles 
that he will purchase at different prices; that is, it describes B’s demand 
behavior. If we draw a budget line for B, then the point where that budget 
line intersects his offer curve represents B’s optimal consumption. 

Thus, if agent A wants to choose the prices to offer to B that make A as 
well-off as possible, she should find that point on B’s offer curve where A 
has the highest utility. Such a choice is depicted in Figure 31.5. 

This optimal choice will be characterized by a tangency condition as 
usual: A’s indifference curve will be tangent to B’s offer curve. If B’s offer 
curve cut A’s indifference curve, there would be some point on B’s offer 
curve that A preferred—so we couldn’t be at the optimal point for A. 

Once we have identified this point—denoted by X in Figure 31.5—we 
just draw a budget line to that point from the endowment. At the prices 
that generate this budget line, B will choose the bundle X, and A will be 
as well-off as possible. 

Is this allocation Pareto efficient? In general the answer is no. To see this 
simply note that A’s indifference curve will not be tangent to the budget 
line at A, and therefore A’s indifference curve will not be tangent to B’s 
indifference curve. A’s indifference curve is tangent to B’s offer curve, 
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Person 



Monopoly in the Edgeworth box. A chooses the point on 
B’s offer curve that gives him the highest utility. 


but it cannot then be tangent to B’s indifference curve. The monopoly 
allocation is Pareto inefficient. 

In fact, it is Pareto inefficient in exactly the same way as described in 
the discussion of monopoly in Chapter 24. At the margin A would like to 
sell more at the equilibrium prices, but she can only do so by lowering the 
price at which she sells—and this will lower her income received from all 
her inframarginal sales. 

We saw in Chapter 25 that a perfectly discriminating monopolist would 
end up producing an efficient level of output. Recall that a discriminating 
monopolist was one who was able to sell each unit of a good to the person 
who was willing to pay the most for that unit. What does a perfectly 
discriminating monopolist look like in the Edgeworth box? 

The answer is depicted in Figure 31.6. Let us start at the initial en¬ 
dowment, W, and imagine A selling each unit of good 1 to B at a different 
price—the price at which B is just indifferent between buying or not buying 
that unit of the good. Thus, after A sells the first unit, B will remain on 
the same indifference curve through W. Then A sells the second unit of 
good 1 to B for the maximum price he is willing to pay. This means that 
the allocation moves further to the left, but remains on B’s indifference 
curve through W. Agent A continues to sell units to B in this manner, 
thereby moving up B’s indifference curve to find her- A’s—most preferred 
point, denoted by an X in Figure 31.6. 


Figure 

31.5 
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Figure 

31.6 



Person 

B 


A perfectly discriminating monopolist. Person A chooses 
the point X on person B’s indifference curve through the en¬ 
dowment that gives him the highest utility. Such a point must 
be Pareto efficient. 


It is easy to see that such a point must be Pareto efficient. Agent A will 
be as well-off as possible given B’s indifference curve. At such a point, A 
has managed to extract all of B’s consumer’s surplus: B is no better off 
than he was at his endowment. 

These two examples provide useful benchmarks with which to think 
about the First Welfare Theorem. The ordinary monopolist gives an exam¬ 
ple of a resource allocation mechanism that results in inefficient equilibria, 
and the discriminating monopolist gives another example of a mechanism 
that results in efficient equilibria. 


31.11 Efficiency and Equilibrium 

The First Welfare Theorem says that the equilibrium in a set of competitive 
markets is Pareto efficient. What about the other way around? Given 
a Pareto efficient allocation, can we find prices such that it is a market 
equilibrium? It turns out that the answer is yes, under certain conditions. 
The argument is illustrated in Figure 31.7. 

Let us pick a Pareto efficient allocation. Then we know that the set of 
allocations that A prefers to her current assignment is disjoint from the set 
that B prefers. This implies of course that the two indifference curves are 
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Person 



The Second Theorem of Welfare Economics. When pref¬ 
erences are convex, a Pareto efficient allocation is an equilibrium 
for some set of prices. 


Figure 

31.7 


tangent at the Pareto efficient allocation. So let us draw in the straight 
line that is their common tangent, as in Figure 31.7. 

Suppose that the straight line represents the agents’ budget sets. Then 
if each agent chooses the best bundle on his or her budget set, the resulting 
equilibrium will be the original Pareto efficient allocation. 

Thus the fact that the original allocation is efficient automatically de¬ 
termines the equilibrium prices. The endowments can be any bundles that 
give rise to the appropriate budget set—that is, bundles that lie somewhere 
on the constructed budget line. 

Can the construction of such a budget line always be carried out? Un¬ 
fortunately, the answer is no. Figure 31.8 gives an example. Here the 
illustrated point X is Pareto efficient, but there are no prices at which A 
and B will want to consume at point X. The most obvious candidate is 
drawn in the diagram, but the optimal demands of agents A and B don’t co¬ 
incide for that budget. Agent A wants to demand the bundle Y, but agent 
B wants the bundle X —demand does not equal supply at these prices. 

The difference between Figure 31.7 and Figure 31.8 is that the preferences 
in Figure 31.7 are convex while the ones in Figure 31.8 are not. If the 
preferences of both agents are convex, then the common tangent will not 
intersect either indifference curve more than once, and everything will work 
out fine. This observation gives us the Second Theorem of Welfare 
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Figure 

31.8 


Person 



A Pareto efficient allocation that is not an equilibrium. 

It is possible to find Pareto efficient allocations such as X in 
this diagram that cannot be achieved by competitive markets if 
preferences are nonconvex. 


Economics: if all agents have convex preferences, then there will always 
be a set of prices such that each Pareto efficient allocation is a market 
equilibrium for an appropriate assignment of endowments. 

The proof is essentially the geometric argument we gave above. At a 
Pareto efficient allocation, the bundles preferred by agent A and by agent 
B must be disjoint. Thus if both agents have convex preferences we can 
draw a straight line between the two sets of preferred bundles that separates 
one from the other. The slope of this line gives us the relative prices, and 
any endowment that puts the two agents on this line will lead to the final 
market equilibrium being the original Pareto efficient allocation. 


31.12 Implications of the First Welfare Theorem 

The two theorems of welfare economics are among the most fundamental 
results in economics. We have demonstrated the theorems only in the sim¬ 
ple Edgeworth box case, but they are true for much more complex models 
with arbitrary numbers of consumers and goods. The welfare theorems 
have profound implications for the design of ways to allocate resources. 

Let us consider the First Welfare Theorem. This says that any compet¬ 
itive equilibrium is Pareto efficient. There are hardly any explicit assump- 
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tions in this theorem—it follows almost entirely from the definitions. But 
there are some implicit assumptions. One major assumption is that agents 
only care about their own consumption of goods, and not about what other 
agents consume. If one agent does care about another agent’s consumption, 
we say that there is a consumption externality. We shall see that when 
consumption externalities are present, a competitive equilibrium need not 
be Pareto efficient. 

To take a simple example, suppose that agent A cares about agent B’s 
consumption of cigars. Then there is no particular reason why each agent 
choosing his or her own consumption bundle at the market prices will result 
in a Pareto efficient allocation. After each person has purchased the best 
bundle he or she can afford, there may still be ways to make both of them 
better off—such as A paying B to smoke fewer cigars. We will discuss 
externalities in more detail in Chapter 33. 

Another important implicit assumption in the First Welfare Theorem 
is that agents actually behave competitively. If there really were only 
two agents, as in the Edgeworth box example, then it is unlikely that 
they would each take price as given. Instead, the agents would probably 
recognize their market power and would attempt to use their market power 
to improve their own positions. The concept of competitive equilibrium 
only makes sense when there are enough agents to ensure that each behaves 
competitively. 

Finally, the First Welfare Theorem is only of interest if a competitive 
equilibrium actually exists. As we have argued above, this will be the case 
if the consumers are sufficiently small relative to the size of the market. 

Given these provisos, the First Welfare Theorem is a pretty strong result: 
a private market, with each agent seeking to maximize his or her own utility, 
will result in an allocation that achieves Pareto efficiency. 

The importance of the First Welfare Theorem is that it gives a general 
mechanism—the competitive market—that we can use to ensure Pareto 
efficient outcomes. If there are only two agents involved, this doesn’t mat¬ 
ter very much; it is easy for two people to get together and examine the 
possibilities for mutual trades. But if there are thousands, or even mil¬ 
lions, of people involved there must be some kind of structure imposed on 
the trading process. The First Welfare Theorem shows that the particular 
structure of competitive markets has the desirable property of achieving a 
Pareto efficient allocation. 

If we are dealing with a resource problem involving many people, it is 
important to note that the use of competitive markets economizes on the 
information that any one agent needs to possess. The only things that a 
consumer needs to know to make his consumption decisions are the prices 
of the goods he is considering consuming. Consumers don’t need to know 
anything about how the goods are produced, or who owns what goods, or 
where the goods come from in a competitive market. If each consumer 
knows only the prices of the goods, he can determine his demands, and if 
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the market functions well enough to determine the competitive prices, we 
are guaranteed an efficient outcome. The fact that competitive markets 
economize on information in this way is a strong argument in favor of their 
use as a way to allocate resources. 


31.13 Implications of the Second Welfare Theorem 

The Second Theorem of Welfare Economics asserts that under certain con¬ 
ditions, every Pareto efficient allocation can be achieved as a competitive 
equilibrium. 

What is the meaning of this result? The Second Welfare Theorem implies 
that the problems of distribution and efficiency can be separated. What¬ 
ever Pareto efficient allocation you want can be supported by the market 
mechanism. The market mechanism is distributionally neutral; whatever 
your criteria for a good or a just distribution of welfare, you can use com¬ 
petitive markets to achieve it. 

Prices play two roles in the market system: an allocative role and a 
distributive role. The allocative role of prices is to indicate relative scarcity; 
the distributive role is to determine how much of different goods different 
agents can purchase. The Second Welfare Theorem says that these two roles 
can be separated: we can redistribute endowments of goods to determine 
how much wealth agents have, and then use prices to indicate relative 
scarcity. 

Policy discussions often become confused on this point. One often hears 
arguments for intervening in pricing decisions on grounds of distributional 
equity. However, such intervention is typically misguided. As we have seen 
above, a convenient way to achieve efficient allocations is for each agent 
to face the true social costs of his or her actions and to make choices that 
reflect those costs. Thus in a perfectly competitive market the marginal 
decision of whether to consume more or less of some good will depend 
on the price—which measures how everyone else values this good on the 
margin. The considerations of efficiency are inherently marginal decisions— 
each person should face the correct marginal tradeoff in making his or her 
consumption decisions. 

The decision about how much different agents should consume is a totally 
different issue. In a competitive market this is determined by the value of 
the resources that a person has to sell. From the viewpoint of the pure 
theory, there is no reason why the state can’t transfer purchasing power— 
endowments—among consumers in any way that is seen fit. 

In fact the state doesn’t need to transfer the physical endowments them¬ 
selves. All that is necessary is to transfer the purchasing power of the 
endowment. The state could tax one consumer on the basis of the value of 
his endowment and transfer this money to another. As long as the taxes 
are based on the value of the consumer’s endowment of goods there will 
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be no loss of efficiency. It is only when taxes depend on the choices that a 
consumer makes that inefficiencies result, since in this case, the taxes will 
affect the consumer’s marginal choices. 

It is true that a tax on endowments will generally change people’s behav¬ 
ior. But, according to the First Welfare Theorem, trade from any initial 
endowments will result in a Pareto efficient allocation. Thus no matter how 
one redistributes endowments, the equilibrium allocation as determined by 
market forces will still be Pareto efficient. 

However, there are practical matters involved. It would be easy to have 
a lump-sum tax on consumers. We could tax all consumers with blue eyes, 
and redistribute the proceeds to consumers with brown eyes. As long as 
eye color can’t be changed, there would be no loss in efficiency. Or we could 
tax consumers with high IQs and redistribute the funds to consumers with 
low IQs. Again, as long as IQ can be measured, there is no efficiency loss 
in this kind of tax. 

But there’s the problem. How do we measure people’s endowment of 
goods? For most people, the bulk of their endowment consists of their 
own labor power. People’s endowments of labor consist of the labor that 
they could consider selling, not the amount of labor that they actually 
end up selling. Taxing labor that people decide to sell to the market is a 
distortionary tax. If the sale of labor is taxed, the labor supply decision 
of consumers will be distorted—they will likely supply less labor than they 
would have supplied in the absence of a tax. Taxing the potential value of 
labor -the endowment of labor—is not distortionary. The potential value 
of labor is, by definition, something that is not changed by taxation. Taxing 
the value of the endowment sounds easy until we realize that it involves 
identifying and taxing something that might be sold, rather than taxing 
something that is sold. 

We could imagine a mechanism for levying this kind of tax. Suppose 
that we considered a society where each consumer was required to give 
the money earned in 10 hours of his labor time to the state each week. 
This kind of tax would be independent of how much the person actually 
worked—it would only depend on the endowment of labor, not on how 
much was actually sold. Such a tax is basically transferring some part of 
each consumer’s endowment of labor time to the state. The state could 
then use these funds to provide various goods, or it could simply transfer 
these funds to other agents. 

According to the Second Welfare Theorem, this kind of lump-sum taxa¬ 
tion would be nondistortionary. Essentially any Pareto efficient allocation 
could be achieved by such lump-sum redistribution. 

However, no one is advocating such a radical restructuring of the tax 
system. Most people’s labor supply decisions are relatively insensitive to 
variations in the wage rate, so the efficiency loss from taxing labor may not 
be too large anyway. But the message of the Second Welfare Theorem is 
important. Prices should be used to reflect scarcity. Lump-sum transfers of 
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wealth should be used to adjust for distributional goals. To a large degree, 
these two policy decisions can be separated. 

People’s concern about the distribution of welfare can lead them to ad¬ 
vocate various forms of manipulation of prices. It has been argued, for 
example, that senior citizens should have access to less expensive telephone 
service, or that small users of electricity should pay lower rates than large 
users. These are basically attempts to redistribute income through the 
price system by offering some people lower prices than others. 

When you think about it this is a terribly inefficient way to redistribute 
income. If you want to redistribute income, why don’t you simply redis¬ 
tribute income? If you give a person an extra dollar to spend, then he can 
choose to consume more of any of the goods that he wants to consume—not 
necessarily just the good being subsidized. 


Summary 

1. General equilibrium refers to the study of how the economy can adjust 
to have demand equal supply in all markets at the same time. 

2. The Edgeworth box is a graphical tool to examine such a general equi¬ 
librium with 2 consumers and 2 goods. 

3. A Pareto efficient allocation is one in which there is no feasible reallo¬ 
cation of the goods that would make all consumers at least as well-off and 
at least one consumer strictly better off. 

4. Walras’ law states that the value of aggregate excess demand is zero for 
all prices. 

5. A general equilibrium allocation is one in which each agent chooses a 
most preferred bundle of goods from the set of goods that he or she can 
afford. 

6. Only relative prices are determined in a general equilibrium system. 

7. If the demand for each good varies continuously as prices vary, then 
there will always be some set of prices where demand equals supply in 
every market; that is, a competitive equilibrium. 

8. The First Theorem of Welfare Economics states that a competitive equi¬ 
librium is Pareto efficient. 

9. The Second Theorem of Welfare Economics states that as long as pref¬ 
erences are convex, then every Pareto efficient allocation can be supported 
as a competitive equilibrium. 
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REVIEW QUESTIONS 


1. Is it possible to have a Pareto efficient allocation where someone is worse 
off than he is at an allocation that is not Pareto efficient? 

2. Is it possible to have a Pareto efficient allocation where everyone is worse 
off than they are at an allocation that is not Pareto efficient? 

3. True or false? If we know the contract curve, then we know the outcome 
of any trading. 

4. Can some individual be made better off if we are at a Pareto efficient 
allocation? 

5. If the value of excess demand in 8 out of 10 markets is equal to zero, 
what must be true about the remaining two markets? 

APPENDIX 

Let us examine the calculus conditions describing Pareto efficient allocations. By 
definition, a Pareto efficient allocation makes each agent as well-off as possible, 
given the utility of the other agent. So let us pick u as the utility level for agent 
B, say, and see how we can make agent A as well-off as possible. 

The maximization problem is 


max ua{x\,x 2 a) 

X A’ X A' X B’ X B 

such that ub (= u 

i i i i 

x A + x B =oo 

2.2 2 
X A + X B = Ul . 

Here co 1 = Ua+oo b is the total amount of good 1 available and u> 2 = loa+oo b is 
the total amount of good 2 available. This maximization problem asks us to find 
the allocation {x\, x\, x B ,x%) that makes person A’s utility as large as possible, 
given a fixed level for person B’s utility, and given that the total amount of each 
good used is equal to the amount available. 

We can write the Lagrangian for this problem as 

L = ua{,x\,x 2 a) - \{u b (x b ,x 2 b ) -u) 

- Hi(xa + X B - U! 1 ) - M2 (xa +X 2 B - LO 2 ). 

Here A is the Lagrange multiplier on the utility constraint, and the /it’s are 
the Lagrange multipliers on the resource constraints. When we differentiate with 
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respect to each of the goods, we have four first-order conditions that must hold 
at the optimal solution: 


dL du A „ 

—r = —i-Mi — U 

dx\ dx\ 


dL 

dx\ 


dlLA 

^ - M2 = 0 
OX A 


dL du B 

— T~ = —A—i iii = 0 

dx B dx B 


dL 
dx | 


= -A 


diis _ 

2 M2 0. 

dx B 


If we divide the first equation by the second, and the third equation by the 
fourth, we have 

MRS a = 


MRS B = 


duA/dxj i 
du A /dx 2 A 

~ m 

M2 

(31.5) 

du B /dx B 

Mi 

(31.6) 

du B /dx\ 

M2 ' 


The interpretation of these conditions is given in the text: at a Pareto efficient 
allocation, the marginal rates of substitution between the two goods must be the 
same. Otherwise, there would be some trade that would make each consumer 
better off. 

Let us recall the conditions that must hold for optimal choice by consumers. If 
consumer A is maximizing utility subject to her budget constraint and consumer 
B is maximizing utility subject to his budget constraint, and both consumers face 
the same prices for goods 1 and 2, we must have 


du A /dx\ _ pi 
du A / dx\ M2 


du B /dx B _ mi , . 

du B /dXg p2 

Note the similarity with the efficiency conditions. The Lagrange multipliers in the 
efficiency conditions, pi and /i 2 , are just like the prices Mi an d M 2 in the consumer 
choice conditions. In fact the Lagrange multipliers in this kind of problem are 
sometimes known as shadow prices or efficiency prices. 

Every Pareto efficient allocation has to satisfy conditions like those in equations 
(31.5) and (31.6). Every competitive equilibrium has to satisfy conditions like 
those in equations (31.7) and (31.8). The conditions describing Pareto efficiency 
and the conditions describing individual maximization in a market environment 
are virtually the same. 






CHAPTER 



PRODUCTION 


In the last chapter we described a general equilibrium model of a pure 
exchange economy and discussed issues of resource allocation when a fixed 
amount of each good was available. In this chapter we want to describe how 
production fits into the general equilibrium framework. When production 
is possible, the amounts of the goods are not fixed but will respond to 
market prices. 

If you thought the two-consumer two-good assumption was a restrictive 
framework in which to examine trade, imagine what production is going to 
look like! The minimal set of players that we can have to make an inter¬ 
esting problem is one consumer, one firm, and two goods. The traditional 
name for this economic model is the Robinson Crusoe economy, after 
Defoe’s shipwrecked hero. 


32.1 The Robinson Crusoe Economy 

In this economy Robinson Crusoe plays a dual role: he is both a consumer 
and a producer. Robinson can spend his time loafing on the beach thereby 
consuming leisure, or he can spend time gathering coconuts. The more 
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coconuts he gathers the more he has to eat, but the less time he has to 
improve his tan. 

Robinson’s preferences for coconuts and leisure are depicted in Fig¬ 
ure 32.1. They are just like the preferences for leisure and consumption 
depicted in Chapter 9, except we are measuring labor on the horizontal 
axis rather than leisure. So far nothing new has been added. 


Figure 

32.1 



The Robinson Crusoe economy. The indifference curves 
depict Robinson’s preferences for coconuts and leisure. The pro¬ 
duction function depicts the technological relationship between 
the amount he works and the amount of coconuts he produces. 


Now let’s draw in the production function, the function that illus¬ 
trates the relationship between how much Robinson works and how many 
coconuts he gets. This will typically have the shape depicted in Figure 32.1. 
The more Robinson works, the more coconuts he will get; but, due to di¬ 
minishing returns to labor, the marginal product of his labor declines: the 
number of extra coconuts that he gets from an extra hour’s labor decreases 
as the hours of labor increase. 

How much does Robinson work and how much does he consume? To 
answer these questions, look for the highest indifference curve that just 
touches the production set. This will give the most-preferred combination 








CRUSOE, INC. 611 


of labor and consumption that Robinson can get, given the technology for 
gathering coconuts that he is using. 

At this point, the slope of the indifference curve must equal the slope of 
the production function by the standard argument: if they crossed, there 
would be some other feasible point that was preferred. This means that the 
marginal product of an extra hour of labor must equal the marginal rate 
of substitution between leisure and coconuts. If the marginal product were 
greater than the marginal rate of substitution, it would pay for Robinson 
to give up a little leisure in order to get the extra coconuts. If the marginal 
product were less than the marginal rate of substitution, it would pay for 
Robinson to work a little less. 


32.2 Crusoe, Inc. 

So far this story is only a slight extension of models we have already seen. 
But now let’s add a new feature. Suppose that Robinson is tired of si¬ 
multaneously being a producer and consumer and that he decides to al¬ 
ternate roles. One day he will behave entirely as a producer, and the 
next day he will behave entirely as a consumer. In order to coordinate 
these activities, he decides to set up a labor market and a coconut mar¬ 
ket. 

He also sets up a firm, Crusoe, Inc., and becomes its sole shareholder. 
The firm is going to look at the prices for labor and coconuts and decide 
how much labor to hire and how many coconuts to produce, guided by 
the principle of profit maximization. Robinson, in his role as a worker, 
is going to collect income from working at the firm; in his role as share¬ 
holder in the firm he will collect profits; and, in his role as consumer he 
will decide how much to purchase of the firm’s output. (No doubt this 
sounds peculiar, but there really isn’t that much else to do on a desert 
island.) 

In order to keep track of his transactions, Robinson invents a currency 
he calls “dollars,” and he chooses, somewhat arbitrarily, to set the price of 
coconuts at one dollar apiece. Thus coconuts are the numeraire good for 
this economy; as we’ve seen in Chapter 2, a numeraire good is one whose 
price has been set to one. Since the price of coconuts is normalized at one, 
we have only to determine the wage rate. What should his wage rate be in 
order to make this market work? 

We’re going to think about this problem first from the viewpoint of 
Crusoe, Inc., and then from the viewpoint of Robinson, the consumer. The 
discussion is a little schizophrenic at times, but that’s what you have to 
put up with if you want to have an economy with only one person. We’re 
going to look at the economy after it has been running along for some time, 
and everything is in equilibrium. In equilibrium, the demand for coconuts 
will equal the supply of coconuts and the demand for labor will equal the 
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supply of labor. Both Crusoe, Inc. and Robinson the consumer will be 
making optimal choices given the constraints they face. 


32.3 The Firm 

Each evening, Crusoe, Inc. decides how much labor it wants to hire the 
next day, and how many coconuts it wants to produce. Given a price of 
coconuts of 1 and a wage rate of labor of w, we can solve the firm’s profit- 
maximization problem in Figure 32.2. We first consider all combinations 
of coconuts and labor that yield a constant level of profits, n. This means 
that 

tv = C — wL. 


Solving for C, we have 

C = 7T T wL. 

Just as in Chapter 19, this formula describes the isoprofit lines—all com¬ 
binations of labor and coconuts that yield profits of 7r. Crusoe, Inc. will 
choose a point where the profits are maximized. As usual, this implies a 
tangency condition: the slope of the production function—the marginal 
product of labor—must equal w, as illustrated in Figure 32.2. 



Profit maximization. Crusoe, Inc. chooses a production plan 
that maximizes profits. At the optimal point the production 
function must be tangent to an isoprofit line. 


Figure 

32.2 
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Thus the vertical intercept of the isoprofit line measures the maximal 
level of profits measured in units of coconuts: if Robinson generates n* 
dollars of profit, this money can buy tt* coconuts, since the price of coconuts 
has been chosen to be 1. There we have it. Crusoe, Inc. has done its job. 
Given the wage w, it has determined how much labor it wants to hire, how 
many coconuts it wants to produce, and what profits it will generate by 
following this plan. So Crusoe, Inc. declares a stock dividend of 7r* dollars 
and mails it off to its sole shareholder, Robinson. 


32.4 Robinson's Problem 

The next day Robinson wakes up and receives his dividend of tt* dollars. 
While eating his coconut breakfast, he contemplates how much he wants 
to work and consume. He may consider just consuming his endowment— 
spend his profits on tt* coconuts and consume his endowment of leisure. 
But listening to his stomach growl is not so pleasant, and it might make 
sense to work for a few hours instead. So Robinson trudges down to Crusoe, 
Inc. and starts to gather coconuts, just as he has done every other day. 

We can describe Robinson’s labor-consumption choice using standard in¬ 
difference curve analysis. Plotting labor on the horizontal axis and coconuts 
on the vertical axis, we can draw in an indifference curve as illustrated in 
Figure 32.3. 

Since labor is a bad, by assumption, and coconuts are a good, the indif¬ 
ference curve has a positive slope as shown in the diagram. If we indicate 
the maximum amount of labor by L , then the distance from L to the chosen 
supply of labor gives Robinson’s demand for leisure. This is just like the 
supply of labor model examined in Chapter 9, except we have reversed the 
origin on the horizontal axis. 

Robinson’s budget line is also illustrated in Figure 32.3. It has a slope of 
w and passes through his endowment point (7r*,0). (Robinson has a zero 
endowment of labor and a n* endowment of coconuts since that would be 
his bundle if he engaged in no market transactions.) Given the wage rate, 
Robinson chooses optimally how much he wants to work and how many 
coconuts he wants to consume. At his optimal consumption, the marginal 
rate of substitution between consumption and leisure must equal the wage 
rate, just as in a standard consumer choice problem. 


32.5 Putting Them Together 

Now we superimpose Figures 32.2 and 32.3 to get Figure 32.4. Look at 
what has happened! Robinson’s bizarre behavior has worked out all right 
after all. He ends up consuming at exactly the same point as he would 
have if he had made all the decisions at once. Using the market system 
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Figure 

32.3 



Robinson’s maximization problem. Robinson the con¬ 
sumer decides how much to work and consume given the prices 
and wages. The optimal point will occur where the indifference 
curve is tangent to the budget line. 


results in the same outcome as choosing the consumption and production 
plans directly. 

Since the marginal rate of substitution between leisure and consump¬ 
tion equals the wage, and the marginal product of labor equals the wage, 
we are assured that the marginal rate of substitution between labor and 
consumption equals the marginal product—that is, that the slopes of the 
indifference curve and the production set are the same. 

In the case of a one-person economy, using the market is pretty silly. Why 
should Robinson bother to break up his decision into two pieces? But in an 
economy with many people, breaking up decisions no longer seems so odd. 
If there are many firms, then questioning each person about how much they 
want of each good is simply impractical. In a market economy the firms 
simply have to look at the prices of goods in order to make their production 
decisions. For the prices of goods measure how much the consumers value 
extra units of consumption. And the decision that the firms face, for the 
most part, is whether they should produce more or less output. 

The market prices reflect the marginal values of the goods that the firms 
use as inputs and outputs. If firms use the change in profits as a guide 
to production, where the profits are measured at market prices, then their 
decisions will reflect the marginal values that consumers place on the goods. 
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Equilibrium in both consumption and production. The 

amount of coconuts demanded by the consumer Robinson equals 
the amount of coconuts supplied by Crusoe, Inc. 


Figure 

32.4 


32.6 Different Technologies 

In the above discussion we have assumed that the technology available to 
Robinson exhibited diminishing returns to labor. Since labor was the only 
input to production, this was equivalent to decreasing returns to scale. 
(This is not necessarily true if there is more than one input!) 

It is useful to consider some other possibilities. Suppose, for example, 
that the technology exhibited constant returns to scale. Recall that con¬ 
stant returns to scale means that using twice as much of all inputs produces 
twice as much output. In the case of a one-input production function, this 
means that the production function must be a straight line through the 
origin as depicted in Figure 32.5. 

Since the technology has constant returns to scale, the argument in Chap¬ 
ter 19 implies that the only reasonable operating position for a competitive 
firm is at zero profits. This is because if the profits were ever greater than 
zero, it would pay for the firm to expand output indefinitely, and if profits 
were ever less than zero, it would pay the firm to produce zero output. 

Thus Robinson’s endowment involves zero profits and L , his initial en¬ 
dowment of labor time. His budget set coincides with the production set, 
and the story is much the same as before. 

The situation is somewhat different with an increasing returns to scale 
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Figure 

32.5 



Constant returns to scale. If the technology exhibits con¬ 
stant returns to scale, Crusoe, Inc., makes zero profits. 


technology, as depicted in Figure 32.6. There is no difficulty in this simple 
example in exhibiting the optimal choice of consumption and leisure for 
Robinson. The indifference curve will be tangent to the production set 
as usual. The problem arises in trying to support this point as a profit- 
maximizing point. For if the firm were faced with the prices given by 
Robinson’s marginal rate of substitution, it would want to produce more 
output than Robinson would demand. 

If the firm exhibits increasing returns to scale at the optimal choice, 
then the average costs of production will exceed the marginal costs of 
production—and that means that the firm will be making negative profits. 
The goal of profit maximization would lead the firm to want to increase 
its output—but this would be incompatible with the demands for output 
and the supplies of inputs from the consumers. In the case depicted, there 
is no price at which the utility-maximizing demand by the consumer will 
equal the profit-maximizing supply from the firm. 

Increasing returns to scale is an example of a nonconvexity. In this 
case the production set—the set of coconuts and labor that are feasible 
for the economy—is not a convex set. Thus the common tangent to the 
indifference curve and the production function at the point ( L*,C *) in 
Figure 32.6 will not separate the preferred points from the feasible points 
as it does in Figure 32.4. 

Nonconvexities such as this pose grave difficulties for the functioning of 
competitive markets. In a competitive market consumers and firms look 
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Increasing returns to scale. The production set exhibits 
increasing returns to scale and the Pareto efficient allocation 
cannot be achieved by a competitive market. 


at just one set of numbers—the market prices—to determine their con¬ 
sumption and production decisions. If the technology and the preferences 
are convex, then the only things that the economic agents need to know 
to make efficient decisions are the relationship between the prices and the 
marginal rates of substitution near the points where the economy is cur¬ 
rently producing: the prices tell the agents everything that is necessary in 
order to determine an efficient allocation of resources. 

But if the technology and/or the preferences are nonconvex, then the 
prices do not convey all the information necessary in order to choose an 
efficient allocation. Information about the slopes of the production function 
and indifference curves far away from the current operating position is also 
necessary. 

However, these observations apply only when the returns to scale are 
large relative to the size of the market. Small regions of increasing returns 
to scale do not pose undue difficulties for a competitive market. 


32.7 Production and the First Welfare Theorem 

Recall that in the case of a pure exchange economy, a competitive equi¬ 
librium is Pareto efficient. This fact is known as the First Theorem of 
Welfare Economics. Does the same result hold in an economy with pro¬ 
duction? The diagrammatic approach used above is not adequate to answer 


Figure 

32.6 
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this question, but a generalization of the algebraic argument we provided 
in Chapter 31 does nicely. It turns out that the answer is yes: if all firms 
act as competitive profit maximizers, then a competitive equilibrium will 
be Pareto efficient. 

This result has the usual caveats. First, it has nothing to do with dis¬ 
tribution. Profit maximization only ensures efficiency, not justice! Second, 
this result only makes sense when a competitive equilibrium actually exists. 
In particular, this will rule out large areas of increasing returns to scale. 
Third, the theorem implicitly assumes that the choices of any one firm do 
not affect the production possibilities of other firms. That is, it rules out 
the possibility of production externalities. Similarly, the theorem re¬ 
quires that firms’ production decisions do not directly affect the consump¬ 
tion possibilities of consumers; that is, that there are no consumption 
externalities. More precise definitions of externalities will be given in 
Chapter 33, where we will examine their effect on efficient allocations in 
more detail. 


32.8 Production and the Second Welfare Theorem 

In the case of a pure exchange economy, every Pareto efficient allocation 
is a possible competitive equilibrium, as long as consumers exhibit convex 
preferences. In the case of an economy involving production, the same 
result is true, but now we require not only that consumers’ preferences are 
convex, but also that firms’ production sets are convex. As discussed above, 
this requirement effectively rules out the possibility of increasing returns 
to scale: if firms have increasing returns to scale at the equilibrium level of 
production, they would want to produce more output at the competitive 
prices. 

However, with constant or decreasing returns to scale, the Second Wel¬ 
fare Theorem works fine. Any Pareto efficient allocation can be achieved 
through the use of competitive markets. Of course in general it will be 
necessary to redistribute endowments among the consumers to support 
different Pareto efficient allocations. In particular, both the income from 
endowments of labor and ownership shares of the firm will have to be redis¬ 
tributed. As indicated in the last chapter, there may be significant practical 
difficulties involved with this sort of redistribution. 


32.9 Production Possibilities 

We have now seen how production and consumption decisions can be made 
in a one-input, one-output economy. In this section we want to explore 
how this model can be generalized to an economy with several inputs and 
outputs. Although we will deal only with the two-good case, the concepts 
will generalize naturally to many goods. 
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So let us suppose that there is some other good that Robinson might 
produce—say fish. He can devote his time to gathering coconuts or to fish¬ 
ing. In Figure 32.7 we have depicted the various combinations of coconuts 
and fish that Robinson can produce from devoting different amounts of time 
to each activity. This set is known as a production possibilities set. 
The boundary of the production possibilities set is called the production 
possibilities frontier. This should be contrasted with the production 
function discussed earlier that depicts the relationship between the input 
good and the output good; the production possibilities set depicts only the 
set of output goods that is feasible. (In more advanced treatments, both 
inputs and outputs can be considered part of the production possibilities 
set, but these treatments cannot easily be handled with two-dimensional 
diagrams.) 



A production possibilities set. The production possibili¬ 
ties set measures the set of outputs that are feasible given the 
technology and the amounts of inputs. 


Figure 

32.7 


The shape of the production possibilities set will depend on the nature 
of the underlying technologies. If the technologies for producing coconuts 
and fish exhibit constant returns to scale the production possibilities set 
will take an especially simple form. Since by assumption there is only one 
input to production—Robinson’s labor -the production functions for fish 
and coconuts will be simply linear functions of labor. 

For example, suppose that Robinson can produce 10 pounds of fish per 
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hour or 20 pounds of coconuts per hour. Then if he devotes Lf hours 
to fish production and L c hours to coconut production, he will produce 
10 Lf pounds of fish and 20L C pounds of coconuts. Suppose that Robinson 
decides to work 10 hours a day. Then the production possibilities set will 
consist of all combinations of coconuts, C, and fish, F, such that 


F = 10 L f 
C = 20 L c 
L c -T Lf = 10. 


The first two equations measure the production relationships, and the 
third measures the resource constraint. To determine the production pos¬ 
sibilities frontier solve the first two equations for Lf and L c to get 


L ' = f» 


Now add these two equations together, and use the fact that Lf + L c = 10 
to find 


F C 
10 + 20 


= 10 . 


This equation gives us all the combinations of fish and coconuts that Robin¬ 
son can produce if he works 10 hours a day. It is depicted in Figure 32.8A. 

The slope of this production possibilities set measures the marginal 
rate of transformation —how much of one good Robinson can get if he 
decides to sacrifice some of the other good. If Robinson gives up enough 
labor to produce 1 less pound of fish, he will be able to get 2 more pounds 
of coconuts. Think about it: if Robinson works one hour less on fish 
production, he will get 10 pounds less fish. But then if he devotes that 
time to coconuts, he will get 20 pounds more coconuts. The tradeoff is at 
a ratio of 2 to 1. 


32.10 Comparative Advantage 

The construction of the production possibilities set given above was quite 
simple since there was only one way to produce fish and one way to produce 
coconuts. What if there is more than one way to produce each good? Sup¬ 
pose that we add another worker to our island economy, who has different 
skills in producing fish and coconuts. 

To be specific, let us call the new worker Friday, and suppose that he 
can produce 20 pounds of fish per hour, or 10 pounds of coconuts per hour. 
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Thus if Friday works for 10 hours, his production possibilities set will be 
determined by 

F = 20L/ 

C = 10 L c 
L c + Lf = 10. 

Doing the same sort of calculations as we did for Robinson, Friday’s 
production possibilities set is given by 


This is depicted in Figure 32.8B. Note that the marginal rate of transfor¬ 
mation between coconuts and fish for Friday is AC/AF = —1/2, whereas 
for Robinson the marginal rate of transformation is —2. For every pound 
of coconuts that Friday gives up, he can get two pounds of fish; for every 
pound of fish that Robinson gives up, he can get two pounds of coconuts. 
In this circumstance we say that Friday has a comparative advantage 
in fish production, and Robinson has a comparative advantage in coconut 
production. In Figure 32.8 we have depicted three production possibilities 
sets: Panel A shows Robinson’s, panel B shows Friday’s, and panel C de¬ 
picts the joint production possibilities set—how much of each good could 
be produced in total by both people. 
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Joint production possibilities. Robinson’s and Friday’s pro¬ 
duction possibilities sets and the joint production possibilities 
set. 


Figure 

32.8 


The joint production possibilities set combines the best of both workers. 
If both workers are used entirely to produce coconuts, we will get 300 
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coconuts—100 from Friday and 200 from Robinson. If we want to get 
more fish, it makes sense to shift the person who is most productive at 
fish—Friday—out of coconut production and into fish production. For 
each pound of coconuts that Friday doesn’t produce we get 2 pounds of 
fish; thus the slope of the joint production possibilities set is —1/2—which 
is exactly Friday’s marginal rate of transformation. 

When Friday is producing 200 pounds of fish, he is fully occupied. If we 
want even more fish, we have to switch to using Robinson. From this point 
on the joint production possibilities set will have a slope of —2, since we 
will be operating along Robinson’s production possibilities set. Finally, if 
we want to produce as much fish as possible, both Robinson and Friday 
concentrate on fish production and we get 300 pounds of fish, 200 from 
Friday, and 100 from Robinson. 

Since the workers each have a comparative advantage in different goods, 
the joint production possibilities set will have a “kink,” as shown in Figure 
32.8. There is only one kink in this example since there are just two different 
ways to produce output—Crusoe’s way and Friday’s way. If there are many 
different ways to produce output, the production possibilities set will have 
the more typical “rounded” structure, as depicted in Figure 32.7. 


32.11 Pareto Efficiency 

In the last two sections we saw how to construct the production possibilities 
set, the set that describes the feasible consumption bundles for the economy 
as a whole. Here we consider Pareto efficient ways to choose among the 
feasible consumption bundles. 

We will indicate aggregate consumption bundles by (X^X 2 ). This in¬ 
dicates that there are X 1 units of good 1 and X 2 units of good 2 that 
are available for consumption. In the Crusoe/Friday economy, the two 
goods are coconuts and fish, but we will use the (A^ 1 , X 2 ) notation in order 
to emphasize the similarities with the analysis in Chapter 31. Once we 
know the total amount of each good, we can draw an Edgeworth box as in 
Figure 32.9. 

Given (X 1 , A 2 ), the set of Pareto efficient consumption bundles will be 
the same sort as those examined in the last chapter: the Pareto efficient 
consumption levels will lie along the Pareto set—the line of mutual tan- 
gencies of the indifference curves, as illustrated in Figure 32.9. These are 
the allocations in which each consumer’s marginal rate of substitution—the 
rate at which he or she is just willing to trade—equals that of the other. 

These allocations are Pareto efficient as far as the consumption decisions 
are concerned. If people can simply trade one good for another, the Pareto 
set describes the set of bundles that exhausts the gains from trade. But in 
an economy with production, there is another way to “exchange” one good 
for another—namely, to produce less of one good and more of another. 
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Production and the Edgeworth box. At each point on the 
production possibilities frontier, we can draw an Edgeworth box 
to illustrate the possible consumption allocations. 


Figure 

32.9 


The Pareto set describes the set of Pareto efficient bundles given the 
amounts of goods 1 and 2 available, but in an economy with production 
those amounts can themselves be chosen out of the production possibilities 
set. Which choices from the production possibilities set will be Pareto 
efficient choices? 

Let us think about the logic underlying the marginal rate of substitution 
condition. We argued that in a Pareto efficient allocation, the MRS of 
consumer A had to be equal to the MRS of consumer B: the rate at which 
consumer A would just be willing to trade one good for the other should 
be equal to the rate at which consumer B would just be willing to trade 
one good for the other. If this were not true, then there would be some 
trade that would make both consumers better off. 

Recall that the marginal rate of transformation (MRT) measures the 
rate at which one good can be “transformed” into the other. Of course, 
one good really isn’t being literally transformed into the other. Rather the 
factors of production are being moved around so as to produce less of one 
good and more of the other. 

Suppose that the economy were operating at a position where the mar¬ 
ginal rate of substitution of one of the consumers was not equal to the 
marginal rate of transformation between the two goods. Then such a po¬ 
sition cannot be Pareto efficient. Why? Because at this point, the rate at 
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which the consumer is willing to trade good 1 for good 2 is different from 
the rate at which good 1 can be transformed into good 2—there is a way 
to make the consumer better off by rearranging the pattern of production. 

Suppose, for example, that the consumer’s MRS is 1; the consumer is 
just willing to substitute good 1 for good 2 on a one-to-one basis. Suppose 
that the MRT is 2, which means that giving up one unit of good 1 will 
allow society to produce two units of good 2. Then clearly it makes sense 
to reduce the production of good 1 by one unit; this will generate two extra 
units of good 2. Since the consumer was just indifferent between giving up 
one unit of good 1 and getting one unit of the other good in exchange, he 
or she will now certainly be better off by getting two extra units of good 2. 

The same argument can be made whenever one of the consumers has a 
MRS that is different from the MRT—there will always be a rearrangement 
of consumption and production that will make that consumer better off. We 
have already seen that for Pareto efficiency each consumer’s MRS should 
be the same, and the argument given above implies that each consumer’s 
MRS should in fact be equal to the MRT. 

Figure 32.9 illustrates a Pareto efficient allocation. The MRSs of each 
consumer are the same, since their indifference curves are tangent in the 
Edgeworth box. And each consumer’s MRS is equal to the MRT—the slope 
of the production possibilities set. 


32.12 Castaways, Inc. 

In the last section we derived the necessary conditions for Pareto effi¬ 
ciency: the MRS of each consumer must equal the MRT. Any way of 
distributing resources that results in Pareto efficiency must satisfy this 
condition. Earlier in this chapter, we claimed that a competitive economy 
with profit-maximizing firms and utility-maximizing consumers would re¬ 
sult in a Pareto efficient allocation. In this section we explore the details 
of how this works. 

Our economy now contains two individuals, Robinson and Friday. There 
are four goods: two factors of production (Robinson’s labor and Friday’s 
labor) and two output goods (coconuts and fish). Let us suppose that 
Robinson and Friday are both shareholders of the firm, which we will now 
refer to as Castaways, Inc. Of course, they are also the sole employees 
and customers, but as usual we shall examine each role in turn, and not 
allow the participants to see the wider picture. After all, the object of the 
analysis is to understand how a decentralized resource allocation system 
works—one in which each person only has to determine his or her own 
decisions, without regard for the functioning of the economy as a whole. 

Start first with Castaways, Inc., and consider the profit-maximization 
problem. Castaways, Inc., produces two outputs, coconuts (C) and fish 
(F), and it uses two kinds of labor, Crusoe’s labor (Lq) and Friday’s labor 
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(Lp). Given the price of coconuts (pc), the price of fish (pf), and the wage 
rates of Crusoe and Friday (wq and wp), the profit-maximization problem 
is 

max pnC + pfF — wcLn — wfLf 
C,F,L f ,Lc 

subject to the technological constraints described by the production possi¬ 
bilities set. 

Let us suppose that the firm finds it optimal in equilibrium to hire L* F 
units of Friday’s labor and L* c units of Crusoe’s labor. The question we 
want to focus on here is how profit maximization determines the pattern 
of output to produce. Let L* = u>cL* c + wfL* f represent the labor costs 
of production, and write the profits of the firm, 7r, as 

7T = pcC + p P F — L*. 

Rearranging this equation, we have 

C - _ + L * - PfF 
PC PC 

This equation describes the isoprofit lines of the firm, as depicted in 
Figure 32.10, with a slope of ~pf/pc and a vertical intercept of (7 t+L*) /pc- 
Since L* is fixed by assumption, higher profits will be associated with 
isoprofit lines that have higher vertical intercepts. 

If the firm wants to maximize its profits, it will choose a point on the 
production possibilities set such that the isoprofit line through that point 
has the highest possible vertical intercept. By this stage, it should be clear 
that this implies that the isoprofit line must be tangent to the production 
possibilities set; that is, that the slope of the production possibilities set 
(the MRT) should be equal to the slope of the isoprofit line, —pf/pc'- 

MRT = - — . 

PC 

We’ve described this profit-maximization problem in the case of one firm, 
but it holds for an arbitrary number of firms: each firm that chooses the 
most profitable way to produce coconuts and fish will operate where the 
marginal rate of transformation between any two goods equals the price 
ratio between those two goods. This holds true even if the firms have quite 
different production possibilities sets, as long as they face the same prices 
for the two goods. 

This means that in equilibrium the prices of the two goods will measure 
the marginal rate of transformation—the opportunity cost of one good in 
terms of the other. If you want more coconuts, you will have to give up 
some fish. How much fish? Just look at the price ratio of fish to coconuts: 
the ratio of these economic variables tells us what the technological tradeoff 
must be. 
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Figure 

32.10 



Profit maximization. At the point yielding maximum profits, 
the marginal rate of transformation must equal the slope of the 
isoprofit line, ~pf/pc- 


32.13 Robinson and Friday as Consumers 

We’ve seen how Castaways, Inc., determines its profit-maximizing produc¬ 
tion plan. In order to do this, it must hire some labor and it may generate 
some profits. When it hires labor, it pays wages to the labor; when it makes 
profits, it pays dividends to its shareholders. Either way the money made 
by Castaways, Inc., gets paid back to Robinson and Friday, either in the 
form of wages or profits. 

Since the firm pays out all of its receipts to its workers and its sharehold¬ 
ers, this means that they must necessarily have enough income to purchase 
its output. This is just a variation on Walras’ law discussed in Chapter 31: 
people get their income from selling their endowments, so they must always 
have enough income to purchase those endowments. Here people get in¬ 
come from selling their endowments and also receive profits from the firm. 
But since money never disappears from or is added to the system, people 
will always have exactly enough money to purchase what is produced. 

What do the consumers do with the money from the firm? As usual, 
they use the money to purchase consumption goods. Each person chooses 
the best bundle of goods that he can afford at the prices pf and pc ■ As 







DECENTRALIZED RESOURCE ALLOCATION 627 


we’ve seen earlier, the optimal consumption bundle for each consumer must 
satisfy the condition that the marginal rate of substitution between the two 
goods must be equal to the common price ratio. But this price ratio is also 
equal to the marginal rate of transformation, due to profit maximization 
by the firm. Thus the necessary conditions for Pareto efficiency are met: 
the MRS of each consumer equals the MRT. 

In this economy, the prices of the goods serve as a signal of relative 
scarcity. They indicate the technological scarcity—how much the produc¬ 
tion of one good must be reduced in order to produce more of the other; 
and they indicate the consumption scarcity—how much people are willing 
to reduce their consumption of one good in order to acquire some of the 
other good. 


32.14 Decentralized Resource Allocation 

The Crusoe-Friday economy is a drastically simplified picture. In order 
to make a start on a larger model of the functioning of an economy, one 
needs to use substantially more elaborate mathematics. However, even this 
simple model contains some useful insights. 

The most important of these is the relationship between individuals’ 
private goals of utility maximization and the social goals of efficient use 
of resources. Under certain conditions, the individuals’ pursuit of private 
goals will result in an allocation that is Pareto efficient overall. Further¬ 
more, any Pareto efficient allocation can be supported as an outcome of 
a competitive market, if initial endowments—including the ownership of 
firms—can be suitably redistributed. 

The great virtue of a competitive market is that each individual and each 
firm only has to worry about its own maximization problem. The only facts 
that need to be communicated among the firms and the consumers are the 
prices of the goods. Given these signals of relative scarcity, consumers and 
firms have enough information to make decisions that achieve an efficient 
allocation of resources. In this sense, the social problems involved in effi¬ 
ciently utilizing resources can be decentralized, and solved at the individual 
level. 

Each individual can solve his or her own problem of what to consume. 
The firms face the prices of the goods the consumers consume and decide 
how much to produce of each of them. In making this decision, they are 
guided by profit signals. In this context, profits serve as exactly the right 
guide. To say that a production plan is profitable is to say that people 
are willing to pay more for some good than it costs to produce it—so it is 
natural to expand the production of such goods. If all firms pursue a com¬ 
petitive profit-maximizing policy, and all consumers choose consumption 
bundles to maximize their own utility, then resulting competitive equilib¬ 
rium must be a Pareto efficient allocation. 
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Summary 

1. The general equilibrium framework can be extended by allowing com¬ 
petitive, profit-maximizing firms to produce goods destined for exchange 
in the economy. 

2. Under certain conditions there exists a set of prices for all of the input 
and output goods in the economy such that the profit-maximizing actions 
of firms along with the utility-maximizing behavior of individuals results 
in the demand for each good equaling the supply in all markets—that is, a 
competitive equilibrium exists. 

3. Under certain conditions the resulting competitive equilibrium will be 
Pareto efficient: the First Welfare Theorem holds in an economy with pro¬ 
duction. 

4. With the addition of convex production sets, the Second Welfare Theo¬ 
rem also holds in the case of production. 

5. When goods are being produced as efficiently as possible, the marginal 
rate of transformation between two goods indicates the number of units of 
one good the economy must give up to obtain additional units of the other 
good. 

6. Pareto efficiency requires that each individual’s marginal rate of substi¬ 
tution be equal to the marginal rate of transformation. 

7. The virtue of competitive markets is that they provide a way to achieve 
an efficient allocation of resources by decentralizing production and con¬ 
sumption decisions. 


REVIEW QUESTIONS 


1. The competitive price of coconuts is $6 per pound and the price of fish 
is $3 per pound. If society were to give up 1 pound of coconuts, how many 
more pounds of fish could be produced? 

2. What would happen if the firm depicted in Figure 32.2 decided to pay a 
higher wage? 

3. In what sense is a competitive equilibrium a good or bad thing for a 
given economy? 
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4. If Robinson’s marginal rate of substitution between coconuts and fish is 
—2 and the marginal rate of transformation between the two goods is —1, 
what should he do if he wants to increase his utility? 

5. Suppose that Robinson and Friday both want 60 pounds of fish and 
60 pounds of coconuts per day. Using the production rates given in the 
chapter, how many hours must Robinson and Friday work per day if they 
don’t help each other? Suppose they decide to work together in the most 
efficient manner possible. Now how many hours each day do they have to 
work? What is the economic explanation for the reduction in hours? 


APPENDIX 


Let us derive the calculus conditions for Pareto efficiency in an economy with 
production. We let X 1 and X 2 represent the total amount of good 1 and good 2 
produced and consumed, as in the body of this chapter: 

v-l 1.1 

X — Xa + x B 

,.2 2.2 

X — Xa + x B . 


The first thing we need is a convenient way to describe the production pos¬ 
sibilities frontier—all the combinations of X 1 and X 2 that are technologically 
feasible. The most useful way to do this for our purposes is by use of the trans¬ 
formation function. This is a function of the aggregate amounts of the two 
goods T(X*,X 2 ), such that the combination (A'^X 2 ) is on the production pos¬ 
sibilities frontier (the boundary of the production possibilities set) if and only 
if 


'/'(A' 1 . A' 2 ) = 0. 


Once we have described the technology, we can calculate the marginal rate 
of transformation: the rate at which we have to sacrifice good 2 in order to 
produce more of good 1. Although the name evokes an image of one good being 
“transformed” into another, that is a somewhat misleading picture. What really 
happens is that other resources are moved from producing good 2 to producing 
good 1. Thus, by devoting fewer resources to good 2 and more to good 1, we 
move from one point on the production possibilities frontier to another. The 
marginal rate of transformation is just the slope of the production possibilities 
set, which we denote by dX 2 /dX 1 . 

Consider a small change in production (dX 1 , dX 2 ) that remains feasible. Thus 
we have 


d nx\x 2 ) dXl + dnx\x 2 ) dx2 = Q 

dX 1 dX 2 


Solving for the marginal rate of transformation: 


dX 2 _ dT/dX 1 
dX 1 - “ dT/dX 2 ’ 
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We’ll use this formula in a moment. 

A Pareto efficient allocation is one that maximizes any one person’s utility, 
given the level of the other people’s utility. In the two-person case, we can write 
this maximization problem as 


max 

x 1 x‘^ X ^ X ^ 
^ A 4 R F 


such that 


ua(xa,Xa) 

u b (xb,x 2 b ) = U 
T(X 1 ,X 2 ) = 0. 


The Lagrangian for this problem is 

L = u a {x 1 a ,x 2 a ) - X(ub(xb,x 2 b ) - u) 
-li(T(X 1 ,X a )-0), 


and the first-order conditions are 

dL du A dT 

- T = - i - M - T = 0 

dx\ dx\ dX 1 

dL _ duA _ dT 
dx 2 1 dx 2 A ^ dX 2 

Jh£ L __ A 9us_ dT 
8 xb dxB ^ dX 1 

dL _ _ dug _ dT 
dx 2 B dx% ^ dX 2 

Rearranging and dividing the first equation by the second gives 

du A /dx\ _ dT/dX 1 
du A /dx\ ~ dT/dX 2 ' 

Performing the same operation on the third and fourth equations gives 

dua/dxl _ dT/dX 1 
dua/dxg dT/dX 2 ' 

The left-hand sides of these equations are our old friends, the marginal rates 
of substitution. The right-hand side is the marginal rate of transformation. Thus 
the equations require that each person’s marginal rate of substitution between 
the goods must equal the marginal rate of transformation: the rate at which each 
person is just willing to substitute one good for the other must be the same as the 
rate at which it is technologically feasible to transform one good into the other. 

The intuition behind this result is straightforward. Suppose that the MRS for 
some individual was not equal to the MRT. Then the rate at which the individual 
would be willing to sacrifice one good to get more of the other would be different 
than the rate that was technologically feasible—but this means that there would 
be some way to increase that individual’s utility while not affecting anyone else’s 
consumption. 


0 

0 . 








CHAPTER 



WELFARE 


Up until now we have focused on considerations of Pareto efficiency in eval¬ 
uating economic allocations. But there are other important considerations. 
It must be remembered that Pareto efficiency has nothing to say about the 
distribution of welfare across people; giving everything to one person will 
typically be Pareto efficient. But the rest of us might not consider this a 
reasonable allocation. In this chapter we will investigate some techniques 
that can be used to formalize ideas related to the distribution of welfare. 

Pareto efficiency is in itself a desirable goal—if there is some way to make 
some group of people better off without hurting other people, why not do 
it? But there will usually be many Pareto efficient allocations; how can 
society choose among them? 

The major focus of this chapter will be the idea of a welfare function, 
which provides a way to “add together” different consumers’ utilities. More 
generally, a welfare function provides a way to rank different distributions 
of utility among consumers. Before we investigate the implications of this 
concept, it is worthwhile considering just how one might go about “adding 
together” the individual consumers’ preferences to construct some kind of 
“social preferences.” 
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33.1 Aggregation of Preferences 

Let us return to our early discussion of consumer preferences. As usual, we 
will assume that these preferences are transitive. Originally, we thought 
of a consumer’s preferences as being defined over his own bundle of goods, 
but now we want to expand on that concept and think of each consumer as 
having preferences over the entire allocation of goods among the consumers. 
Of course, this includes the possibility that the consumer might not care 
about what other people have, just as we had originally assumed. 

Let us use the symbol x to denote a particular allocation—a description 
of what every individual gets of every good. Then given two allocations, x 
and y, each individual i can say whether or not he or she prefers x to y. 

Given the preferences of all the agents, we would like to have a way to 
“aggregate” them into one social preference. That is, if we know how 
all the individuals rank various allocations, we would like to be able to 
use this information to develop a social ranking of the various allocations. 
This is the problem of social decision making at its most general level. Let’s 
consider a few examples. 

One way to aggregate individual preferences is to use some kind of voting. 
We could agree that x is “socially preferred” to y if a majority of the 
individuals prefer x to y. However, there is a problem with this method 
it may not generate a transitive social preference ordering. Consider, for 
example, the case illustrated in Table 33.1. 


Table Preferences that lead to intransitive voting. 

33.1 


Person A 

Person B 

Person C 

X 

y 

z 

y 

Z 

X 

Z 

X 

y 


Here we have listed the rankings for three alternatives, x, y, and z, by 
three people. Note that a majority of the people prefer x to y, a majority 
prefer y to z, and a majority prefer z to x. Thus aggregating individual 
preferences by majority vote won’t work since, in general, the social prefer¬ 
ences resulting from majority voting aren’t well-behaved preferences, since 
they are not transitive. Since the preferences aren’t transitive, there will be 
no “best” alternative from the set of alternatives (x,y,z). Which outcome 
society chooses will depend on the order in which the vote is taken. 
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To see this suppose that the three people depicted in Table 33.1 decide to 
vote first on x versus y, and then vote on the winner of this contest versus 
z. Since a majority prefer x to y, the second contest will be between x and 
z, which means that z will be the outcome. 

But what if they decide to vote on z versus x and then pit the winner of 
this vote against y? Now z wins the first vote, but y beats z in the second 
vote. Which outcome is the overall winner depends crucially on the order 
in which the alternatives are presented to the voters. 

Another kind of voting mechanism that we might consider is rank-order 
voting. Here each person ranks the goods according to his preferences and 
assigns a number that indicates its rank in his ordering: for example, a 1 
for the best alternative, 2 for the second best, and so on. Then we sum up 
the scores of each alternative across the people to determine an aggregate 
score for each alternative and say that one outcome is socially preferred to 
another if it has a lower score. 

In Table 33.2 we have illustrated a possible preference ordering for three 
allocations x, y, and z by two people. Suppose first that only alternatives 
x and y were available. Then in this example x would be given a rank of 1 
by person A and 2 by person B. The alternative y would be given just the 
reverse ranking. Thus the outcome of the voting would be a tie with each 
alternative having an aggregate rank of 3. 


The choice between x and y depends on z. Table 

33.2 


Person A 

Person B 

X 

y 

y 

Z 

Z 

X 


But now suppose that z is introduced to the ballot. Person A would give 
x a score of 1, y a score of 2, and z a rank of 3. Person B would give y a 
score of 1, z a score of 2, and x a score of 3. This means that x would now 
have an aggregate rank of 4, and y would have an aggregate rank of 3. In 
this case y would be preferred to x by rank-order voting. 

The problem with both majority voting and rank-order voting is that 
their outcomes can be manipulated by astute agents. Majority voting can 
be manipulated by changing the order on which things are voted so as 
to yield the desired outcome. Rank-order voting can be manipulated by 
introducing new alternatives that change the final ranks of the relevant 
alternatives. 





634 WELFARE (Ch. 33) 


The question naturally arises as to whether there are social decision 
mechanisms—ways of aggregating preferences—that are immune to this 
kind of manipulation? Are there ways to “add up” preferences that don’t 
have the undesirable properties described above? 

Let’s list some things that we would want our social decision mechanism 
to do: 

1. Given any set of complete, reflexive, and transitive individual prefer¬ 
ences, the social decision mechanism should result in social preferences 
that satisfy the same properties. 

2. If everybody prefers alternative x to alternative y, then the social pref¬ 
erences should rank x ahead of y. 

3. The preferences between x and y should depend only on how people rank 
x versus y, and not on how they rank other alternatives. 

All three of these requirements seem eminently plausible. Yet it can 
be quite difficult to find a mechanism that satisfies all of them. In fact, 
Kenneth Arrow has proved the following remarkable result: 1 

Arrow’s Impossibility Theorem. If a social decision mechanism satis¬ 
fies properties 1, 2, and 3, then it must be a dictatorship: all social rankings 
are the rankings of one individual. 

Arrow’s Impossibility Theorem is quite surprising. It shows that three 
very plausible and desirable features of a social decision mechanism are 
inconsistent with democracy: there is no “perfect” way to make social 
decisions. There is no perfect way to “aggregate” individual preferences to 
make one social preference. If we want to find a way to aggregate individual 
preferences to form social preferences, we will have to give up one of the 
properties of a social decision mechanism described in Arrow’s theorem. 


33.2 Social Welfare Functions 

If we were to drop any of the desired features of a social welfare function 
described above, it would probably be property 3—that the social prefer¬ 
ences between two alternatives only depends on the ranking of those two 
alternatives. If we do that, certain kinds of rank-order voting become pos¬ 
sibilities. 


1 See Kenneth Arrow, Social Choice and Individual Values (New York: Wiley, 1963). 
Arrow, a professor at Stanford University, was awarded the Nobel Prize in economics 
for his work in this area. 
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Given the preferences of each individual i over the allocations, we can 
construct utility functions, ztj(x), that summarize the individuals’ value 
judgments: person i prefers x to y if and only if 14(x) > 14(y). Of course, 
these are just like all utility functions—they can be scaled in any way that 
preserves the underlying preference ordering. There is no unique utility 
representation. 

But let us pick some utility representation and stick with it. Then one 
way of getting social preferences from individuals’ preferences is to add up 
the individual utilities and use the resulting number as a kind of social util¬ 
ity. That is, we will say that allocation x is socially preferred to allocation 

y if 

n n 

> E u *(y)’ 

i= 1 i— 1 

where n is the number of individuals in the society. 

This works—but of course it is totally arbitrary, since our choice of utility 
representation is totally arbitrary. The choice of using the sum is also 
arbitrary. Why not use a weighted sum of utilities? Why not use the 
product of utilities, or the sum of the squares of utilities? 

One reasonable restriction that we might place on the “aggregating func¬ 
tion” is that it be increasing in each individual’s utility. That way we are 
assured that if everybody prefers x to y, then the social preferences will 
prefer x to y. 

There is a name for this kind of aggregating function; it is called a social 
welfare function. A social welfare function is just some function of the 
individual utility functions: W(zti(x),..., u„(x)). It gives a way to rank 
different allocations that depends only on the individual preferences, and 
it is an increasing function of each individual’s utility. 

Let’s look at some examples. One special case mentioned above is the 
sum of the individual utility functions 


n 

W(u 1: ...,«„) = E m »- 

i=1 

This is sometimes referred to as a classical utilitarian or Benthamite 
welfare function. 2 A slight generalization of this form is the weighted- 
sum-of-utilities welfare function: 


n 

W(ui, ...,u n ) = Y^. 

2=1 


2 Jeremy Bentham (1748-1832) was the founder of the utilitarian school of moral phi¬ 
losophy, a school that considers the highest good to be the greatest happiness for the 
greatest number. 
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Here the weights, ai,..., a n , are supposed to be numbers indicating how 
important each agent’s utility is to the overall social welfare. It is natural 
to take each ai as being positive. 

Another interesting welfare function is the minimax or Rawlsian social 
welfare function: 


W(ui, ...,«„)= min{ui,... ,u n }. 


This welfare function says that the social welfare of an allocation depends 
only on the welfare of the worst off agent—the person with the minimal 
utility. 3 

Each of these is a possible way to compare individual utility functions. 
Each of them represents different ethical judgments about the comparison 
between different agents’ welfares. About the only restriction that we will 
place on the structure of the welfare function at this point is that it be 
increasing in each consumer’s utility. 

33.3 Welfare Maximization 

Once we have a welfare function we can examine the problem of welfare 
maximization. Let us use the notation xj to indicate how much individual 
i has of good j, and suppose that there are n consumers and k goods. Then 
the allocation x consists of the list of how much each of the agents has of 
each of the goods. 

If we have a total amount X 1 , ..., X k of goods 1 ,k to distribute 
among the consumers, we can pose the welfare maximization problem: 


max W(iti(x),..., u„(x)) 


n 



n 



Thus we are trying to find the feasible allocation that maximizes social 
welfare. What properties does such an allocation have? 

The first thing that we should note is that a maximal welfare allocation 
must be a Pareto efficient allocation. The proof is easy: suppose that 


3 


John Rawls is a contemporary moral philosopher at Harvard who has argued for this 
principle of justice. 
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it were not. Then there would be some other feasible allocation that gave 
everyone at least as large a utility, and someone strictly greater utility. But 
the welfare function is an increasing function of each agent’s utility. Thus 
this new allocation would have to have higher welfare, which contradicts 
the assumption that we originally had a welfare maximum. 

We can illustrate this situation in Figure 33.1, where the set U indicates 
the set of possible utilities in the case of two individuals. This set is known 
as the utility possibilities set. The boundary of this set—the utility 
possibilities frontier —is the set of utility levels associated with Pareto 
efficient allocations. If an allocation is on the boundary of the utility pos¬ 
sibilities set, then there are no other feasible allocations that yield higher 
utilities for both agents. 



Welfare maximization. An allocation that maximizes a wel¬ 
fare function must be Pareto efficient. 


The “indifference curves” in this diagram are called isowelfare curves 
since they depict those distributions of utility that have constant welfare. 
As usual, the optimal point is characterized by a tangency condition. But 
for our purposes, the notable thing about this maximal welfare point is 
that it is Pareto efficient—it must occur on the boundary of the utility 
possibilities set. 

The next observation we can make from this diagram is that any Pareto 
efficient allocation must be a welfare maximum for some welfare function. 


Figure 

33.1 
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Figure 

33.2 



Maximization of the weighted-sum-of-utilities welfare 
function. If the utility possibility set is convex, then ev¬ 
ery Pareto efficient point is a maximum for a weighted-sum- 
of-utilities welfare function. 


An example is given in Figure 33.2. 

In Figure 33.2 we have picked a Pareto efficient allocation and found a 
set of isowelfare curves for which it yields maximal welfare. Actually, we 
can say a bit more than this. If the set of possible utility distributions is 
a convex set, as illustrated, then every point on its frontier is a welfare 
maximum for a weighted-sum-of-utilities welfare function, as illustrated in 
Figure 33.2. The welfare function thus provides a way to single out Pareto 
efficient allocations: every welfare maximum is a Pareto efficient allocation, 
and every Pareto efficient allocation is a welfare maximum. 


33.4 Individualistic Social Welfare Functions 

Up until now we have been thinking of individual preferences as being 
defined over entire allocations rather than over each individual’s bundle 
of goods. But, as we remarked earlier, individuals might only care about 
their own bundles. In this case, we can use Xi to denote individual i's 
consumption bundle, and let Ui(xi) be individual i's utility level using 
some fixed representation of utility. Then a social welfare function will 
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have the form 

W = W Oi(a;i),... ,u n (x n )). 

The welfare function is directly a function of the individuals’ utility levels, 
but it is indirectly a function of the individual agents’ consumption bun¬ 
dles. This special form of welfare function is known as an individualistic 
welfare function or a Bergson-Samuelson welfare function. 4 

If each agent’s utility depends only on his or her own consumption, then 
there are no consumption externalities. Thus the standard results of Chap¬ 
ter 31 apply and we have an intimate relationship between Pareto efficient 
allocations and market equilibria: all competitive equilibria are Pareto ef¬ 
ficient, and, under appropriate convexity assumptions, all Pareto efficient 
allocations are competitive equilibria. 

Now we can carry this categorization one step further. Given the rela¬ 
tionship between Pareto efficiency and welfare maxima described above, we 
can conclude that all welfare maxima are competitive equilibria and that 
all competitive equilibria are welfare maxima for some welfare function. 


33.5 Fair Allocations 

The welfare function approach is a very general way to describe social 
welfare. But because it is so general it can be used to summarize the 
properties of many kinds of moral judgments. On the other hand, it isn’t 
much use in deciding what kinds of ethical judgments might be reasonable 
ones. 

Another approach is to start with some specific moral judgments and 
then examine their implications for economic distribution. This is the 
approach taken in the study of fair allocations. We start with a definition 
of what might be considered a fair way to divide a bundle of goods, and then 
use our understanding of economic analysis to investigate its implications. 

Suppose that you were given some goods to divide fairly among n equally 
deserving people. How would you do it? It is probably safe to say that 
in this problem most people would divide the goods equally among the n 
agents. Given that they are by hypothesis equally deserving, what else 
could you do? 

What is appealing about this idea of equal division? One appealing 
feature is that it is symmetric. Each agent has the same bundle of goods; 
no agent prefers any other agent’s bundle of goods to his or her own, since 
they all have exactly the same thing. 

Unfortunately, an equal division will not necessarily be Pareto efficient. 
If agents have different tastes they will generally desire to trade away from 

4 Abram Bergson and Paul Samuelson are contemporary economists who investigated 
properties of this kind of welfare function in the early 1940s. Samuelson was awarded 
a Nobel Prize in economics for his many contributions. 
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equal division. Let us suppose that this trade takes place and that it moves 
us to a Pareto efficient allocation. 

The question arises: is this Pareto efficient allocation still fair in any 
sense? Does trade from equal division inherit any of the symmetry of the 
starting point? 

The answer is: not necessarily. Consider the following example. We 
have three people, A, B, and C. A and B have the same tastes, and C has 
different tastes. We start from an equal division and suppose that A and 
C get together and trade. Then they will typically both be made better 
off. Now B, who didn’t have the opportunity to trade with C, will envy 
A -that is, he would prefer A’s bundle to his own. Even though A and 
B started with the same allocation, A was luckier in her trading, and this 
destroyed the symmetry of the original allocation. 

This means that arbitrary trading from an equal division will not nec¬ 
essarily preserve the symmetry of the starting point of equal division. We 
might well ask if there is any allocation that preserves this symmetry? Is 
there any way to get an allocation that is both Pareto efficient and equitable 
at the same time? 


33.6 Envy and Equity 

Let us now try to formalize some of these ideas. What do we mean by 
“symmetric” or “equitable” anyway? One possible set of definitions is as 
follows. 

We say an allocation is equitable if no agent prefers any other agent’s 
bundle of goods to his or her own. If some agent i does prefer some other 
agent j's bundle of goods, we say that i envies j. Finally, if an allocation 
is both equitable and Pareto efficient, we will say that it is a fair allocation. 

These are ways of formalizing the idea of symmetry alluded to above. An 
equal division allocation has the property that no agent envies any other 
agent— but there are many other allocations that have this same property. 

Consider Figure 33.3. To determine whether any allocation is equitable 
or not, just look at the allocation that results if the two agents swap bun¬ 
dles. If this swapped allocation lies “below” each agent’s indifference curve 
through the original allocation, then the original allocation is an equitable 
allocation. (Here “below” means below from the point of view of each 
agent; from our point of view the swapped allocation must lie between the 
two indifference curves.) 

Note also that the allocation in Figure 33.3 is also Pareto efficient. Thus 
it is not only equitable, in the sense that we defined the term, but it is also 
efficient. By our definition, it is a fair allocation. Is this kind of allocation 
a fluke, or will fair allocations typically exist? 

It turns out that fair allocations will generally exist, and there is an easy 
way to see that this is so. We start as we did in the last section, where 
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Figure 

33.3 


we had an equal division allocation and considered trading to a Pareto 
efficient allocation. Instead of using just any old way to trade, let us use 
the special mechanism of the competitive market. This will move us to a 
new allocation where each agent is choosing the best bundle of goods he or 
she can afford at the equilibrium prices (pi,p 2 ), and we know from Chapter 
31 that such an allocation must be Pareto efficient. 

But is it still equitable? Well, suppose not. Suppose that one of the 
consumers, say consumer A, envies consumer B. This means that A prefers 
what B has to her own bundle. In symbols: 

(xa,Xa) 

But, if A prefers B’s bundle to her own, and if her own bundle is the 
best bundle she can afford at the prices ( Pi,P 2 ), this means that B’s bundle 
must cost more than A can afford. In symbols: 

PlUJA +P2UJA < PlXB +P2X 2 B - 

But this is a contradiction! For by hypothesis, A and B started with 
exactly the same bundle, since they started from an equal division. If A 
can’t afford B’s bundle, then B can’t afford it either. 

Thus we can conclude that it is impossible for A to envy B in these 
circumstances. A competitive equilibrium from equal division must be a 
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fair allocation. Thus the market mechanism will preserve certain kinds of 
equity: if the original allocation is equally divided, the final allocation must 
be fair. 


Summary 

1. Arrow’s Impossibility Theorem shows that there is no ideal way to ag¬ 
gregate individual preferences into social preferences. 

2. Nevertheless, economists often use welfare functions of one sort or an¬ 
other to represent distributional judgments about allocations. 

3. As long as the welfare function is increasing in each individual’s utility, 
a welfare maximum will be Pareto efficient. Furthermore, every Pareto 
efficient allocation can be thought of as maximizing some welfare function. 

4. The idea of fair allocations provides an alternative way to make distribu¬ 
tional judgments. This idea emphasizes the idea of symmetric treatment. 

5. Even when the initial allocation is symmetric, arbitrary methods of trade 
will not necessarily produce a fair allocation. However, it turns out that 
the market mechanism will provide a fair allocation. 


REVIEW QUESTIONS 


1. Suppose that we say that an allocation x is socially preferred to an 
allocation y only if everyone prefers x to y. (This is sometimes called the 
Pareto ordering, since it is closely related to the idea of Pareto efficiency.) 
What shortcoming does this have as a rule for making social decisions? 

2. A Rawlsian welfare function counts only the welfare of the worst off 
agent. The opposite of the Rawlsian welfare function might be called the 
“Nietzschean” welfare function—a welfare function that says the value of 
an allocation depends only on the welfare of the best off agent. What 
mathematical form would the Nietzschean welfare function take? 

3. Suppose that the utility possibilities set is a convex set and that con¬ 
sumers care only about their own consumption. What kind of allocations 
represent welfare maxima of the Nietzschean welfare function? 

4. Suppose that an allocation is Pareto efficient, and that each individual 
only cares about his own consumption. Prove that there must be some 
individual that envies no one, in the sense described in the text. (This 
puzzle requires some thought, but it is worth it.) 
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5. The ability to set the voting agenda can often be a powerful asset. 
Assuming that social preferences are decided by pair-wise majority voting 
and that the preferences given in Table 30.1 hold, demonstrate this fact by 
producing a voting agenda that results in allocation y winning. Find an 
agenda that has z as the winner. What property of the social preferences 
is responsible for this agenda-setting power? 


APPENDIX 

Here we consider the problem of welfare maximization, using an individualistic 
welfare function. Using the transformation function described in Chapter 32 to 
describe the production possibilities frontier, we write the welfare maximization 
problem as 

max W(u A (x\,x 2 A ),u B {x 1 B ,x 2 B )) 

X A’ X A' X B’ X B 

such that T(X 1 ,X 2 ) = 0, 

where we use X 1 and X 2 to denote the total amount of good 1 and good 2 
produced and consumed. 

The Lagrangian for this problem is 

L = - A (T(X\X 2 ) - 0). 

Differentiating with respect to each of the choice variables gives us the first- 
order conditions 

dL _ ow du A {x\,x 2 A ) } dT{x\x 2 ) 

dx\ 9 u a Qx\ dX 1 

dL = OW du A {x\,x 2 A ) _ dT{X\X 2 ) = 
dx 2 A du A dx A dX 2 

dL = OW du B {x l B ,x |) } dT{X\ X 2 ) _ 
dx B du B dx B dX 1 

dL = dW 9ub(jb,sI) _ dT{X\X 2 ) 

dx 2 B du B Qx 2 B dX 2 

Rearranging and dividing the first equation by the second, and the third by 
the fourth, we have 

du A /dx\ dT/dX 1 
du A /dx\ = dT/dX 2 
dus/dx]} _ dT/dX 1 
du B /dx 2 B ~ dT/dX 2 ' 

Note that these are exactly the same equations that we encountered in the Ap¬ 
pendix to Chapter 32. Thus the welfare maximization problem gives us the same 
first-order conditions as the Pareto efficiency problem. 

This is obviously no accident. According to the discussion in the text, the allo¬ 
cation resulting from the maximization of a Bergson-Samuelson welfare function 
is Pareto efficient, and every Pareto efficient allocation maximizes some welfare 
function. Thus welfare maxima and Pareto efficient allocations have to satisfy 
the same first-order conditions. 















CHAPTER 



EXTERNALITIES 


We say that an economic situation involves a consumption externality if 
one consumer cares directly about another agent’s production or consump¬ 
tion. For example, I have definite preferences about my neighbor playing 
loud music at 3 in the morning, or the person next to me in a restaurant 
smoking a cheap cigar, or the amount of pollution produced by local auto¬ 
mobiles. These are all examples of negative consumption externalities. On 
the other hand, I may get pleasure from observing my neighbor’s flower 
garden—this is an example of a positive consumption externality. 

Similarly, a production externality arises when the production pos¬ 
sibilities of one firm are influenced by the choices of another firm or con¬ 
sumer. A classic example is that of an apple orchard located next to a 
beekeeper, where there are mutual positive production externalities—each 
firm’s production positively affects the production possibilities of the other 
firm. Similarly, a fishery cares about the amount of pollutants dumped 
into its fishing area, since this will negatively influence its catch. 

The crucial feature of externalities is that there are goods people care 
about that are not sold on markets. There is no market for loud music at 3 
in the morning, or drifting smoke from cheap cigars, or a neighbor who 
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keeps a beautiful flower garden. It is this lack of markets for externalities 
that causes problems. 

Up until now we have implicitly assumed that each agent could make 
consumption or production decisions without worrying about what other 
agents were doing. All interactions between consumers and producers took 
place via the market, so that all the economic agents needed to know were 
the market prices and their own consumption or production possibilities. 
In this chapter we will relax this assumption and examine the economic 
consequences of externalities. 

In earlier chapters we saw that the market mechanism was capable of 
achieving Pareto efficient allocations when externalities were not present. 
If externalities are present, the market will not necessarily result in a Pareto 
efficient provision of resources. However, there are other social institutions 
such as the legal system, or government intervention, that can “mimic” the 
market mechanism to some degree and thereby achieve Pareto efficiency. 
In this chapter we’ll see how these institutions work. 


34.1 Smokers and Nonsmokers 

It is convenient to start with an example to illustrate some of the main 
considerations. We’ll imagine two roommates, A and B, who have prefer¬ 
ences over “money” and “smoke.” We suppose that both consumers like 
money, but that A likes to smoke and B likes clean air. 

We can depict the consumption possibilities for the two consumers in 
an Edgeworth box. The length of the horizontal axis will represent the 
total amount of money the two agents have, and the height of the vertical 
axis will represent the total amount of smoke that can be generated. The 
preferences of agent A are increasing in both money and smoke, while 
agent B’s preferences are increasing in money and clean air—the absence 
of smoke. We’ll measure smoke on a scale from 0 to 1, where 0 is no smoke 
at all, and 1 is the proverbial smoke-filled room. 

This setup gives us a diagram like that depicted in Figure 34.1. Note 
that the picture looks very much like the standard Edgeworth box, but the 
interpretation is quite different. The amount of smoke is a good for A and 
a bad for B, so that B is moved to a more preferred position as A consumes 
less smoke. Be sure to note the difference in the way things are measured 
on the horizontal and vertical axes. We measure A’s money horizontally 
from the lower left-hand corner of the box, and B’s money horizontally from 
the upper right-hand corner. But the total amount of smoke is measured 
vertically from the lower left-hand corner. The difference occurs because 
money can be divided between the two consumers, so there will always be 
two amounts of money to measure, but there is only one amount of smoke 
that they must both consume. 
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In the ordinary Edgeworth box diagram B is made better off when A 
reduces his consumption of good 2- but that is because B then gets to 
consume more of good 2. In the Edgeworth box in Figure 34.1 B is also 
better off when A reduces his consumption of good 2 (smoke), but for a 
very different reason. In this example, B is better off when A reduces his 
consumption of smoke since both agents must consume the same amount 
of smoke and smoke is a bad for agent B. 

We’ve now illustrated the consumption possibilities of the two roommates 
and their preferences. What about their endowments? Let’s assume that 
they both have the same amount of money, say $100 apiece, so that their 
endowments will lie somewhere on the vertical line in Figure 34.1. In 
order to determine exactly where on this line the endowments lie, we must 
determine the initial “endowment” of smoke/clean air. 


Figure 

34.1 



Preferences for money and smoke. Smoke is a good for 
person A but a bad for person B. Which equilibrium we end up 
at depends on which endowment we start at. 


The answer to this question depends on the legal rights of smokers and 
nonsmokers. It may be that A has a right to smoke as much as he wants, 
and B just has to put up with it. Or, it could be that B has a right to 
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clean air. Or the legal right to smoke and clean air could be somewhere 
between these two extremes. 

The initial endowment of smoke depends on the legal system. This is 
not so different from the initial endowment of ordinary sorts of goods. To 
say that A has an initial endowment of $100 means that A can decide to 
consume the $100 himself, or he can give it away or trade it to any other 
individual. There is a legal definition of property involved in saying that a 
person “owns” or “has a right to” $100. Similarly if a person has a property 
right to clean air, it means that he can consume clean air if he wants to, or 
he can give it away or sell that right to someone else. In this way, having 
a property right to clean air is no different from having a property right to 
$100. 

Let’s start by considering a legal situation where person B has a legal 
right to clean air. Then the initial endowment in Figure 34.1 is labeled E; 
it is where A has (100,0) and B has (100,0). This means that both A and 
B have $100, and that the initial endowment—what there would be in the 
absence of trade—is clean air. 

Just as before, in the case with no externalities, there is no reason why 
the initial endowment is Pareto efficient. One of the aspects of having a 
property right to clean air is having the right to trade some of it away for 
other desirable goods—in this case, for money. It can easily happen that B 
would prefer to trade some of his right to clean air for some more money. 
The point labeled X in Figure 34.1 is an example of such a case. 

As before, a Pareto efficient allocation is one where neither consumer 
can be made better off without the other being made worse off. Such an 
allocation will be characterized by the usual tangency condition that the 
marginal rates of substitution between smoke and money should be the 
same between the two agents, as illustrated in Figure 34.1. It is easy to 
imagine A and B trading to such a Pareto efficient point. In effect, B has 
the right to clean air, but he can allow himself to be “bribed” to consume 
some of A’s smoke. 

Of course, other assignments of property rights are possible. We could 
imagine a legal system where A had a right to smoke as much as he wanted, 
and B would have to bribe A to reduce his consumption of smoke. This 
would correspond to the endowment labeled E' in Figure 34.1. Just as 
before, this would typically not be Pareto efficient, so we could imagine the 
agents trading to a mutually preferred point such as the one labeled X'. 

Both X and X' are Pareto efficient allocations; they just come from 
different initial endowments. Certainly the smoker, A, is better off at X' 
than at X , and the nonsmoker, B, is better off at X than at X'. The 
two points have different distributional consequences, but on grounds of 
efficiency they are equally satisfactory. 

In fact, there is no reason to limit ourselves to just these two efficient 
points. As usual there will be a whole contract curve of Pareto efficient 
allocations of smoke and money. If agents are free to trade both of these 
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goods, we know that they will end up somewhere on this contract curve. 
The exact position will depend on their property rights involving smoke 
and money and on the precise mechanism that they use to trade. 

One mechanism that they could use to trade is the price mechanism. 
Just as before we could imagine an auctioneer calling out prices and asking 
how much each agent would be willing to buy at those prices. If the initial 
endowment point gave A the property rights to smoke, he could consider 
selling some of his smoking rights to B in exchange for B’s money. Similarly, 
if the property rights for clean air were given to B, he could sell some of 
his clean air to A. 

When the auctioneer manages to find a set of prices where supply equals 
demand everything is fine: we have a nice Pareto efficient outcome. If 
there is a market for smoke, a competitive equilibrium will be Pareto effi¬ 
cient. Furthermore, the competitive prices will measure the marginal rate 
of substitution between the two goods, just as in the standard case. 

This is just like the usual Edgeworth box analysis, but described in 
a slightly different framework. As long as we have well-defined property 
rights in the good involving the externality—no matter who holds the prop¬ 
erty rights—the agents can trade from their initial endowment to a Pareto 
efficient allocation. If we want to set up a market in the externality to 
encourage trade, that will work as well. 

The only problem arises if the property rights are not well defined. If A 
believes that he has the right to smoke and B believes that he has the right 
to clean air, we have difficulties. The practical problems with externalities 
generally arise because of poorly defined property rights. 

My neighbor may believe that he has the right to play his trumpet at 3 in 
the morning, and I may believe that I have the right to silence. A firm may 
believe that it has the right to dump pollutants into the atmosphere that I 
breathe, while I may believe that it doesn’t. Cases where property rights are 
poorly defined can lead to an inefficient production of externalities—which 
means that there would be ways to make both parties involved better off by 
changing the production of externalities. If property rights are well defined, 
and mechanisms are in place to allow for negotiation between people, then 
people can trade their rights to produce externalities in the same way that 
they trade rights to produce and consume ordinary goods. 


34.2 Quasilinear Preferences and the Coase Theorem 

We argued above that as long as property rights were well defined, trade 
between agents would result in an efficient allocation of the externality. In 
general, the amount of the externality that will be generated in the efficient 
solution will depend on the assignment of property rights. In the case of the 
two roommates, the amount of smoke generated will depend on whether 
the smoker has the property rights or the nonsmoker has them. 
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But there is a special case where the outcome of the externality is inde¬ 
pendent of the assignment of property rights. If the agents’ preferences are 
quasilinear, then every efficient solution must have the same amount of 
the externality. 

This case is illustrated in Figure 34.2 for the Edgeworth box case of 
the smoker versus the nonsmoker. Since the indifference curves are all 
horizontal translates of each other, the locus of mutual tangencies—the set 
of Pareto efficient allocations—will be a horizontal line. This means that 
the amount of smoke is the same in every Pareto efficient allocation; only 
the dollar amounts held by the agents differ across the efficient allocations. 


Person 



Quasilinear preferences and the Coase theorem. If each 
consumer’s preferences are quasilinear, so that they are all hor¬ 
izontal translates of each other, the set of Pareto efficient allo¬ 
cations will be a horizontal line. Thus there will be a unique 
amount of the externality, in this case smoke, at each Pareto 
efficient allocation. 


Figure 

34.2 


The result that under certain conditions the efficient amount of the good 
involved in the externality is independent of the distribution of property 
rights is sometimes known as the Coase Theorem. However, it should be 
emphasized just how special these conditions are. The quasilinear prefer¬ 
ence assumption implies that the demands for the good causing the exter- 
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nality doesn’t depend on the distribution of income. Therefore a realloca¬ 
tion of endowments doesn’t affect the efficient amount of the externalities. 
This is sometimes expressed by saying that the Coase theorem is valid if 
there are no “income effects.” 1 

In this case, the Pareto efficient allocations will involve a unique amount 
of the externality being generated. The different Pareto efficient allocations 
will involve different amounts of money being held by the consumers; but 
the amount of the externality—the amount of smoke—will be independent 
of the distribution of wealth. 


34.3 Production Externalities 

Let us now consider a situation involving production externalities. Firm 
S produces some amount of steel, s, and also produces a certain amount 
of pollution, x , which it dumps into a river. Firm F, a fishery, is located 
downstream and is adversely affected by S’s pollution. 

Suppose that firm S’s cost function is given by c s (s,a;), where s is the 
amount of steel produced and x is the amount of pollution produced. Firm 
F’s cost function is given by c/(/, x), where / indicates the production of 
fish and x is the amount of pollution. Note that F’s costs of producing a 
given amount of fish depend on the amount of pollution produced by the 
steel firm. We will suppose that pollution increases the cost of providing 
fish Acf/Ax > 0, and that pollution decreases the cost of steel production, 
Ac s /Ax < 0. This last assumption says that increasing the amount of 
pollution will decrease the cost of producing steel—that reducing pollution 
will increase the cost of steel production, at least over some range. 

The steel firm’s profit-maximization problem is 

max p s s — c s (s, x) 

S,X 

and the fishery’s profit-maximization problem is 

max p f f -c f {f,x). 

Note that the steel mill gets to choose the amount of pollution that it 
generates, but the fishery must take the level of pollution as outside of its 
control. 

1 Ronald Coase is an emeritus professor at the University of Chicago Law School. His 
famous paper, “The Problem of Social Costs,” The Journal of Law & Economics , 3 
(October 1960), has been given a variety of interpretations. Some authors suggest 
that Coase only asserted that costless bargaining over externalities achieves a Pareto 
efficient outcome, not that the outcome will be independent of the assignment of 
property rights. Coase received the 1991 Nobel Prize in Economics for this work. 
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The conditions characterizing profit maximization will be 


Ps = 
0 = 


A c s (s*,x*) 
As 

Ac s (s*,x*) 

Ax 


for the steel firm and 

„ * <=/(/•,**) 

P, ~ A / 

for the fishery. These conditions say that at the profit-maximizing point, 
the price of each good—steel and pollution—should equal its marginal cost. 
In the case of the steel firm, one of its products is pollution, which, by 
assumption, has a zero price. So the condition determining the profit- 
maximizing supply of pollution says to produce pollution until the cost of 
an extra unit is zero. 

It is not hard to see the externality here: the fishery cares about the 
production of pollution but has no control over it. The steel firm looks 
only at the cost of producing steel when it makes its profit-maximizing 
calculation; it doesn’t consider the cost it imposes on the fishery. The 
increase in the cost of fishing associated with an increase in pollution is 
part of the social cost of steel production, and it is being ignored by the 
steel firm. In general, we expect that the steel firm will produce too much 
pollution from a social point of view since it ignores the impact of that 
pollution on the fishery. 

What does a Pareto efficient production plan for steel and fish look like? 
There is an easy way to see what it should be. Suppose that the fishery 
and the steel firm merged and formed one firm that produced both fish 
and steel (and possibly pollution). Then there is no externality! For a 
production externality only arises when one firm’s actions affect another 
firm’s production possibilities. If there is only one firm, then it will take the 
interactions between its different “divisions” into account when it chooses 
the profit-maximizing production plan. We say that the externality has 
been internalized by this reassignment of property rights. Before the 
merger, each firm had the right to produce whatever amount of steel or fish 
or pollution that it wanted, regardless of what the other firm did. After 
the merger, the combined firm has the right to control the production of 
both the steel mill and the fishery. 

The merged firm’s profit-maximization problem is 


max PaS + pff - c s (s,x ) - c/(/, x), 

s,f,x 
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which yields optimality conditions of 

Ac s (s, x) 

PS = ^~ 

_ A c f (f,x) 

Pf A / 

= Ac s (a,x) A c f {f,x) 

Ax Ax 


The crucial term is the last one. This shows that the merged firm will 
take into account the effect of pollution on the marginal costs of both 
the steel firm and the fishery. When the steel division decides how much 
pollution to produce, it considers the effect of this action on the profits of 
the fish division; that is, it takes the social cost of its production plan into 
account. 

What does this imply about the amount of pollution produced? When 
the steel firm acted independently, the amount of pollution was determined 
by the condition 


Ac s (s*,x*) 

Ax 


= 0 . 


(34.1) 


That is, the steel mill produced pollution until the marginal cost was zero: 


MC s (s*,x*) = 0. 


In the merged firm, the amount of pollution is determined by the condition 


A c s (s,x) A c f {f,x) 

Ax Ax 


(34.2) 


That is, the merged firm produces pollution until the sum of the marginal 
cost to the steel mill and the marginal cost to the fishery is zero. This 
condition can also be written as 


Acs(s,x) 

Ax 


&c f (f,x) 

Ax 


(34.3) 


or 

-MC s (s,x) = MC F (f,x). 

In this latter expression MCf(/,i) is positive, since more pollution in¬ 
creases the cost of producing a given amount of fish. Hence the merged 
firm will want to produce where — MCs(s,x ) is positive; that is, it will 
want to produce less pollution than the independent steel firm. When the 
true social cost of the externality involved in the steel production is taken 
into account, the optimal production of pollution will be reduced. 

When the steel firm considers minimizing its private costs of producing 
steel, it produces where the marginal cost of extra pollution equals zero; 
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but the Pareto efficient level of pollution requires minimizing the social 
costs of the pollution. At the Pareto efficient level of pollution, the sum 
of the two firm’s marginal costs of pollution must be equal to zero. 

This argument is illustrated in Figure 34.3. In this diagram —MCs 
measures the marginal cost to the steel firm from producing more pollution. 
The curve labeled MCf measures the marginal cost to the fishery of more 
pollution. The profit-maximizing steel firm produces pollution up to the 
point where its marginal cost from generating more pollution equals zero. 



Social cost and private cost. The steel firm produces pollu¬ 
tion up to the point where the marginal cost of extra pollution 
equals zero. But the Pareto efficient production of pollution 
is at the point where price equals marginal social cost, which 
includes the cost of pollution borne by the fishery. 


Figure 

34.3 


But at the Pareto efficient level of pollution, the steel firm pollutes up to 
the point where the effect of a marginal increase in pollution is equal to the 
marginal social cost, which counts the impact of pollution on the costs of 
both firms. At the efficient level of pollution production, the amount that 
the steel firm is willing to pay for an extra unit of pollution should equal 
the social costs generated by that extra pollution—which include the costs 
it imposes on the fishery. 

This is perfectly consistent with the efficiency arguments given in earlier 
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chapters. There we assumed that there were no externalities, so that private 
costs and social costs coincided. In this case the free market will determine 
a Pareto efficient amount of output of each good. But if the private costs 
and the social costs diverge, the market alone may not be sufficient to 
achieve Pareto efficiency. 

EXAMPLE: Pollution Vouchers 

Everyone wants a clean environment ... as long as someone else pays for 
it. Even if we reach a consensus on how much we should reduce pollution, 
there is still the problem of determining the most cost-effective way to 
achieve the targeted reduction. 

Take the case of nitrogen oxide emissions. One emitter may find it rela¬ 
tively inexpensive to reduce its emissions of this pollutant, whereas another 
may find it very expensive. Should they both be required to reduce their 
emission of pollutants by the same physical amount, by the same propor¬ 
tional amount, or by some other rule? 

Let’s look at a simple economic model. Suppose that there are only 
two firms. Firm l’s emission quota is X\ and firm 2’s is X2 ■ The cost of 
achieving an emission quota X\ is Ci(xi) and similarly for firm 2. The total 
amount of emission is fixed at some target level X. If we want to minimize 
the total costs of achieving the emissions target, subject to the aggregate 
constraint, we need to solve the following problem: 

min Ci(xi) + c 2 (x 2 ) 

Ei, X2 

such that xi + X 2 = X. 

A by now standard economic argument shows that the marginal cost of 
emission control must be equalized across the firms. If one firm had a 
higher marginal cost of emission control than the other, then we could 
lower total costs by reducing its quota and increasing the quota of the 
other firm. 

How can we achieve this outcome? If the government regulators had 
information on the cost of emissions for all firms, they could calculate the 
appropriate pattern of production and impose it on all the relevant parties. 
But the cost of gathering all this information, and keeping it up-to-date, is 
staggering. It is much easier to characterize the optimal solution than to 
actually implement it! 

Many economists have argued that the best way to implement the effi¬ 
cient solution to the emission control problem is to use a market. It appears 
that such a market based emissions control system will soon be put into 
effect in Southern California. Here is how the California plan works. 2 

2 See Richard Stevenson, “Trying a Market Approach to Smog,” New York Times , 

March 25, 1992, Cl. 
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Each of the 2700 largest polluters in Southern California is assigned a 
quota for their emissions of nitrogen oxide. This quota is initially set to be 
8 percent less than their previous year’s emission. If the firm exactly meets 
its emissions quota it faces no fines or penalties. However, if it reduces its 
emissions by more than its emissions quota, it can sell the extra “right to 
emit” on the open market. 

Suppose that a firm’s quota is 95 tons of nitrogen oxide emissions per 
year. If it manages to produce only 90 tons in a given year, then it can sell 
the right to emit 5 tons of nitrogen oxide to some other firm. Each firm can 
compare the market price of an emission credit to the cost of reducing its 
emissions and decide whether it was more cost-effective to reduce emissions 
further or purchase emission credits from other firms. 

Firms that find it easy to reduce emissions will sell credits to firms that 
find it costly to reduce emissions. In equilibrium, the market price of the 
right to emit one ton of pollution should just equal the marginal cost of 
reducing emissions by one ton. But this is exactly the condition character¬ 
izing the optimal pattern of emissions! The market for emission permits 
produces the efficient pattern of emissions automatically. 


34.4 Interpretation of the Conditions 

There are several useful interpretations of the conditions for Pareto effi¬ 
ciency derived above. Each of these interpretations suggests a scheme to 
correct the efficiency loss created by the production externality. 

The first interpretation is that the steel firm faces the wrong price for 
pollution. As far as the steel firm is concerned, its production of pollution 
costs it nothing. But that neglects the costs that the pollution imposes 
on the fishery. According to this view, the situation can be rectified by 
making sure that the polluter faces the correct social cost of its actions. 

One way to do this is to place a tax on the pollution generated by the 
steel firm. Suppose that we put a tax of t dollars per unit of pollution 
generated by the steel firm. Then the profit-maximization problem of the 
steel firm becomes 

max p s s — c s (s, x ) — tx. 

S,X 

The profit-maximization conditions for this problem will be 

Ac s (s,x) 

Ac s (s, x) 

- = 0 . 

Ax 

Comparing these conditions to equation (34.3), we see that setting 
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will make these conditions the same as the conditions characterizing the 
Pareto efficient level of pollution. 

This kind of a tax is known as a Pigouvian tax . 3 The problem with 
Pigouvian taxes is that we need to know the optimal level of pollution in 
order to impose the tax. But if we knew the optimal level of pollution we 
could just tell the steel firm to produce exactly that much and not have to 
mess with this taxation scheme at all. 

Another interpretation of the problem is that there is a missing market— 
the market for the pollutant. The externality problem arises because the 
polluter faces a zero price for an output good that it produces, even though 
people would be willing to pay money to have that output level reduced. 
From a social point of view, the output of pollution should have a negative 
price. 

We could imagine a world where the fishery had the right to clean water, 
but could sell the right to allow pollution. Let q be the price per unit of 
pollution, and let x be the amount of pollution that the steel mill produces. 
Then the steel mill’s profit-maximization problem is 

max p s s — qx — c s (s,x ), 

S,X 

and the fishery’s profit-maximization problem is 

max p f f + qx- c f (f,x). 

f,x 


The term qx enters with a negative sign in the profit expression for the 
steel firm since it represents a cost—the steel firm must buy the right to 
generate x units of pollution. But it enters with a positive sign in the 
expression for the profits of the fishery, since the fishery gets revenue from 
selling this right. 

The profit-maximization conditions are 


A c s (s,x) 

Ps = As 

(34.4) 

Ac s (s,x) 
q Ax 

(34.5) 

A Cf (f,x) 

Pf A / 

(34.6) 

Ac/(/, x) 
q Ax ' 

(34.7) 


3 Arthur Pigou (1877-1959), an economist at Cambridge University, suggested such 
taxes in his influential book The Economics of Welfare. 
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Thus each firm is facing the social marginal cost of each of its actions 
when it chooses how much pollution to buy or sell. If the price of pollution 
is adjusted until the demand for pollution equals the supply of pollution, 
we will have an efficient equilibrium, just as with any other good. 

Note that at the optimal solution, equations (34.5) and (34.7) imply that 

Ac s (s, a;) _ A c/(f,x) 

Ax Ax 

This says that the marginal cost to the steel firm of reducing pollution 
should equal the marginal benefit to the fishery of that pollution reduction. 
If this condition were not satisfied, we couldn’t have the optimal level of 
pollution. This is, of course, the same condition we encountered in equation 
(34.3). 

In analyzing this problem we have stated that the fishery had a right to 
clean water and that the steel mill had to purchase the right to pollute. 
But we could have assigned the property rights in the opposite way: the 
steel mill could have the right to pollute and the fishery would have to pay 
to induce the steel mill to pollute less. Just as in the case of the smoker 
and nonsmoker, this would also give an efficient outcome. In fact, it would 
give precisely the same outcome, since exactly the same equations would 
have to be satisfied. 

To see this, we now suppose that the steel mill has the right to pollute 
up to some amount x, say, but the fishery is willing to pay it to reduce its 
pollution. The profit-maximization problem for the steel mill is then 

max p s s + q(x — x) — c s (s, x). 

S,X 

Now the steel mill has two sources of income: it can sell steel, and it can 
sell pollution relief. The price equals marginal cost conditions become 


Ac s (s, x) 

P ■ A. =° 

(34.8) 

Ac s (s,x) _ 

Ax 

(34.9) 


The fishery’s maximization problem is now 

max p f f - q(x - x) -c f (f,x), 

f,x 

which has optimality conditions 


A c f{f,x) 

A/ 


A c f(f,x) 

Ax 


= 0 
= 0 . 


(34.10) 

(34.11) 
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Now observe: the four equations (34.8)-(34.11) are precisely the same as 
the four equations (34.4)-(34.7). In the case of production externalities, the 
optimal pattern of production is independent of the assignment of property 
rights. Of course, the distribution of profits will generally depend on the 
assignment of property rights. Even though the social outcome will be 
independent of the distribution of property rights, the owners of the firms in 
question may have strong views about what is an appropriate distribution. 


34.5 Market Signals 

Finally we turn to the third interpretation of externalities, which in some 
respects is the most profound. In the case of the steel mill and the fishery 
there is no problem if both firms merge—so why don’t they merge? In fact, 
when you think about it, there is a definite incentive for the two firms to 
merge: if the actions of one affect the other, then they can make higher 
profits together by coordinating their behavior than by each going alone. 
The objective of profit maximization itself should encourage the internal¬ 
ization of production externalities. 

Said another way: if the joint profits of the firms with coordination 
exceed the sum of the profits without coordination, then the current owners 
could each be bought out for an amount equal to the present value of the 
stream of profits for their firm, the two firms could be coordinated, and the 
buyer could retain the excess profits. The new buyer could be either of the 
old firms, or anybody else for that matter. 

The market itself provides a signal to internalize production externalities, 
which is one reason this kind of production externality is rarely observed. 
Most firms have already internalized the externalities between units that 
affect each other’s production. The case of the apple orchard and the 
beekeeper mentioned earlier is a case in point. Here there would be an 
externality if the two firms ignored their interaction ... but why would 
they be so foolish as to do so? It is more likely that one or both of the 
firms would realize that more profits could be made by coordinating their 
activities, either by mutual agreement or by the sale of one of the firms to 
the other. Indeed, it is very common for apple orchards to keep honey bees 
for the purpose of fertilizing the trees. That particular externality is easily 
internalized. 


EXAMPLE: Bees and Almonds 

Many varieties of fruit and nut trees need bees to pollinate their blossoms, 
thereby allowing the trees to produce crops. 

According to the Carl Hayden Bee Research Center in Tucson, Arizona, 
honeybees pollinate about one-third of the human diet and more than 50 


THE TRAGEDY OF THE COMMONS 659 


different agricultural crops valued at more than $20 billion a year in the 
United States. 4 

Some owners of orchards keep their own bees; some rely on their neigh¬ 
bors’ bees or wild bees. However, as the theory of externalities suggests, 
the most natural solution to the problem of inadequate bee supply is a 
market for bee services. 

Consider, for example, the California almond market. There are 530,000 
acres of almond trees in California, and every year, more than 1 million 
honeybee hives are needed to pollinate the trees. But California only has 
440,000 resident bee hives. There aren’t enough California bees to pollinate 
all those almond trees! 

The solution is to import bees from other nearby states. There is, in 
fact, a ready market for such services, with beekeepers bringing hives from 
North Dakota, Washington, and Colorado to supplement the native Cali¬ 
fornia bees. The almond growers pay well for these services: in 2004, bee 
pollination services sold for $54 per hive. 


34.6 The Tragedy of the Commons 


We have argued above that if property rights are well defined, there will be 
no problem with production externalities. But if property rights are not 
well defined, the outcome of the economic interactions will undoubtedly 
involve inefficiencies. 

In this section we will examine a particularly well-known inefficiency 
called “the tragedy of the commons.” 5 We will pose this problem in the 
original context of a common grazing land, although there are many other 
possible illustrations. 

Consider an agricultural village in which the villagers graze their cows 
on a common held. We want to compare two allocation mechanisms: the 
first is the private ownership solution where someone owns the held and 
decides how many cows should graze there; the second is the solution where 
the held is owned in common by the villagers and access to it is free and 
unrestricted. 

Suppose that it costs a dollars to buy a cow. How much milk the cow 
produces will depend on how many other cows are grazed on the common 
land. We’ll let /(c) be the value of the milk produced if there are c cows 
grazed on the common. Thus the value of the milk per cow is just the 
average product, /(c)/c. 


4 Anna Oberthur, “Almond Growers Face Need for Bees,” Associated Press, February 
29, 2004. 

5 See G. Hardin, “The Tragedy of the Commons,” Science, 1968, 1243—47. 
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How many cows would be grazed on the common if we wanted to maxi¬ 
mize the total wealth of the village? In order to maximize the total amount 
of wealth, we set up the following problem: 

max /(c) — ac. 

C 

It should be clear by now that the maximal production will occur when 
the marginal product of a cow equals its cost, a: 

MP(c*) = a. 

If the marginal product of a cow were greater than a , it would pay to put 
another cow on the commons; and if it were less than a, it would pay to 
take one off. 

If the common grazing ground were owned by someone who could restrict 
access to it, this is indeed the solution that would result. For in this case, 
the owner of the grazing grounds would purchase just the right amount of 
cows to maximize his profits. 

Now what would happen if the individual villagers decided whether or 
not to use the common field? Each villager has a choice of grazing a cow 
or not grazing one, and it will be profitable to graze a cow as long as the 
output generated by the cow is greater than the cost of a cow. Suppose that 
there are c cows currently being grazed, so that the current output per cow 
is /(c)/c. When a villager contemplates adding a cow, the total output will 
be /(c+ 1), and the total number of cows will be c+ 1. Thus the revenue 
that the cow generates for the villager will be /(c + l)/(c + 1). He must 
compare this revenue to the cost of the cow, a. If /(c + l)/(c + 1) > a, it 
is profitable to add the cow since the value of the output exceeds the cost. 
Hence the villagers will choose to graze cows until the average product of 
a cow is driven to a. It follows that the total number of cows grazed will 
be c, where 



c 


Another way to derive this result is to appeal to free entry. If it is 
profitable to graze a cow on the common field, villagers will purchase cows. 
They will stop adding cows to the common only when the profits have been 
driven to zero, that is, when 


/(c) -ac= 0, 


which is just a rearrangement of the condition in the last paragraph. 

When an individual decides whether or not to purchase a cow, he looks 
at the extra value he will get /(c)/c and compares this to the cost of the 
cow, a. This is fine for him, but what has been left out of this calculation is 
the fact that his extra cow will reduce the output of milk from all the other 
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cows. Since he is ignoring this social cost of his purchase, too many cows 
will be grazed on the common ground. (We assume that each individual 
has a number of cows that is negligible relative to the total number grazed 
on the common.) 

This argument is illustrated in Figure 34.4. Here we have depicted a 
falling average product curve, since it is reasonable to suppose that the 
output per cow declines as more and more cows are grazed on the common 
land. 


AP 

MP 


a = cost of cow 



Efficient Equilibrium 

output output 


NUMBER OF COWS 


The tragedy of the commons. If the grazing area is pri¬ 
vately owned, the number of cows will be chosen so that the 
marginal product of a cow equals its cost. But if grazing area 
is common property, cows will be grazed until the profits are 
driven to zero; thus the area will be overgrazed. 


Figure 

34.4 


Since the average product is falling, it must be that the marginal product 
curve always lies below the average product curve. Thus the number of 
cows where the marginal product equals a must be less than where the 
average product equals a. The field will be overgrazed in the absence of a 
mechanism to restrict use. 

Private property provides such a mechanism. Indeed, we have seen that 
if everything that people care about is owned by someone who can control 
its use and, in particular, can exclude others from overusing it, then there 
are by definition no externalities. The market solution leads to a Pareto 
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efficient outcome. Inefficiencies can only result from situations where there 
is no way to exclude others from using something, a topic that we will 
investigate in the next chapter. 

Of course, private property is not the only social institution that can 
encourage efficient use of resources. For example, rules could be formulated 
about how many cows can be grazed on the village common. If there is a 
legal system to enforce those rules, this may be a cost-effective solution to 
providing an efficient use of the common resource. However, in situations 
where the law is ambiguous or nonexistent, the tragedy of the commons 
can easily arise. Overfishing in international waters and the extermination 
of several species of animals due to overhunting are sobering examples of 
this phenomenon. 


EXAMPLE: Overfishing 

According to a report in the New York Times, “... overfishing has deci¬ 
mated the stocks of cod, haddock and flounder that have sustained New 
Englanders for centuries.” 6 According to one expert, fishermen in New 
England are taking 50 to 70 percent of the available stock, over twice the 
sustainable amount. 

This overfishing is a prime example of the problem of the commons: 
each fisherman has a negligible impact on the total stock of fish, but the 
accumulated efforts of thousands of fishermen results in serious depletion. 
The New England Fisheries Management Council is attempting to alleviate 
the problem by banning new entry to the industry, requiring fishermen to 
limit their days at sea, and increasing the mesh size of their nets. 

It appears that the supplies of fish could be restocked in as little as 5 
years if conservation measures were undertaken. The present value of prof¬ 
its to the industry as a whole would be larger with regulation to prevent 
overfishing. However, such measures would almost certainly imply a sub¬ 
stantial reduction in the number of fishing boats in the industry, which is 
highly unpopular with the small fishermen, who would likely be forced to 
leave the industry. 


EXAMPLE: New England Lobsters 

Some fishing industries have already applied stringent rules to avoid over¬ 
fishing. For example, lobster fishermen work under carefully designed rules 
to ensure that they do not fish themselves out of a livelihood. For example, 
they are required to toss back any female lobster bearing eggs, any lobster 


‘Plenty of Fish in the Sea? Not Anymore,” New York Times, March 25, 1992, A15. 
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shorter than a minimum size, and any lobsters bigger than a maximum 
size. 

The “eggers” give birth to more lobsters and the small “tiddlers” grow 
up to mate. But why throw back the big lobsters? According to marine 
biologists, large lobsters produce more offspring and larger offspring. If 
fishermen always took the largest lobsters, the remaining small lobsters 
would pass their genes onto their progeny, resulting in smaller and smaller 
lobsters in each generation. 

With lobsters there is good news and bad news. First the good news. 
The 2003 Maine lobster harvest was 5.4 million pounds, more than 2.5 times 
the 1945-85 average. This suggests that the careful husbandry practiced 
by the industry has yielded a significant growth in the lobster population. 

However, it appears that conservation isn’t the only factor. There have 
also been considerable changes in the population of other species of marine 
life off the Maine coast, such as sea urchins, and some observers believe that 
these changes are the primary driver of change in the lobster population. 7 

This leads to the bad news. Further south, in Massachusetts and New 
York, the lobster catch has fallen dramatically. No one is quite sure why 
one region is doing so well and the other so poorly. Ironically, Maine may 
be doing well due to increased harvesting of finned fish and of sea urchins, 
both of which eat young lobsters. Massachusetts’ problems may be due 
to specific factors, such as a large oil spill and a disfiguring shell disease. 
Another culprit is warming water: Narragansett Bay temperatures have 
risen almost two degrees Celsius in the last 20 years. 

Ecologies can be very complex and can change rapidly. The efforts to 
avoid overfishing are to be applauded, but they are only part of the story. 


34.7 Automobile Pollution 

As suggested above, pollution is a prime example of an economic external¬ 
ity. The activity of one consumer operating an automobile will typically 
lower the quality of the air that other consumers breathe. It seems unlikely 
that an unregulated free market would generate the optimal amount of pol¬ 
lution; more likely, if the consumer bears no cost in generating pollution, 
too much pollution would be produced. 

One approach to controlling the amount of automobile pollution is to re¬ 
quire that automobiles meet certain standards in the amount of pollution 
that they generate. This has been the basic thrust of U.S. antipollution 
policy since the Clean Air Act of 1963. That act, or, more properly, the 
subsequent amendments, set automobile emission standards for the manu¬ 
facturers of vehicles in the United States. 

7 See The Economist , “Claws!” August 19, 2004, and Cornelia Dean, “Lobster Boom 
and Bust,” New York Times, August 9, 2004. 
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Lawrence White has examined the benefits and costs of this program; 
most of the following discussion is drawn from this work. 8 

White estimates that the cost of emission control equipment is about 
$600 per car, the extra maintenance costs are about $180 per car, and 
the costs of the reduced gasoline mileage and the necessity for unleaded 
gasoline come to about $670 per car. Thus the total cost per car of the 
emission control standards is about $1450 over the lifetime of the car. (All 
figures are in 1981 dollars.) 

He argues that there are several problems with the current approach to 
the regulation of automobile emissions. First, it requires that all automo¬ 
biles meet the same standards. (California is the only state with different 
standards for emission control.) This means that everyone who buys a car 
must pay an extra $1450 whether they live in a high pollution area or not. 
A 1974 National Academy of Sciences study concluded that 63 percent of 
all U.S. cars did not require the stringent standards now in effect. Accord¬ 
ing to White, “almost two-thirds of car buyers are spending ... substantial 
sums for unnecessary systems.” 

Secondly, most of the responsibility for meeting the standards falls on 
the manufacturer, and little falls on the user. Owners of cars have little 
incentive to keep their pollution control equipment in working order unless 
they live in a state with required inspections. 

More significantly, motorists have no incentive to economize on their 
driving. In cities such as Los Angeles, where pollution is a significant 
hazard, it makes good economic sense to encourage people to drive less. 
Under the current system, people who drive 2000 miles a year in North 
Dakota pay exactly the same amount for pollution control as people who 
drive 50,000 miles a year in Los Angeles. 

An alternative solution to pollution would be effluent fees. As described 
by White, effluent fees would require an annual inspection of all vehicles 
along with an odometer reading and tests that would estimate the likely 
emissions of the vehicle during the past year. Different communities could 
then levy fees based on the estimated amount of pollution that had actu¬ 
ally been generated by the operation of the vehicle. This method would 
ensure that people would face the true cost of generating pollution and 
would encourage them to choose to generate the socially optimal amount 
of pollution. 

Such a system of effluent fees would encourage the vehicle owners them¬ 
selves to find low-cost ways of reducing their emissions—investing in pol¬ 
lution control equipment, changing their driving habits, and changing the 
kinds of vehicles that they operate. A system of effluent fees could im¬ 
pose even higher standards than are now in effect in communities where 
pollution is a serious problem. Any desired level of pollution control can 


See Lawrence White, The Regulation of Air Pollutant Emissions from Motor Vehicles 
(Washington, D.C.: American Enterprise Institute for Public Policy Research, 1982). 
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be achieved by appropriate effluent fees ... and it can be achieved at a 
substantially lower cost than the current system of mandated standards. 

Of course, there is no reason why there might not also be some federally 
mandated standards for the two-thirds of the vehicles that are operated in 
localities where pollution is not a serious problem. If it is cheaper to impose 
standards than to require inspections, then by all means that should be the 
proper choice. The appropriate method of pollution control for automobiles 
should depend on a rational analysis of benefits and costs—as should all 
social policies of this nature. 


Summary 

1. The First Theorem of Welfare Economics shows that a free, competitive 
market will provide an efficient outcome in the absence of externalities. 

2. However, if externalities are present, the outcome of a competitive mar¬ 
ket is unlikely to be Pareto efficient. 

3. However, in this case, the state can sometimes “mimic” the role of the 
market by using prices to provide correct signals about the social cost of 
individual actions. 

4. More importantly, the legal system can ensure that property rights are 
well defined, so that efficiency-enhancing trades can be made. 

5. If preferences are quasilinear, the efficient amount of a consumption 
externality will be independent of the assignment of property rights. 

6. Cures for production externalities include the use of Pigouvian taxes, 
setting up a market for the externality, simply allowing firms to merge, or 
transferring property rights in other ways. 

7. The tragedy of the commons refers to the tendency for common property 
to be overused. This is a particularly prevalent form of externality. 


REVIEW QUESTIONS 


1. True or false? An explicit delineation of property rights usually elimi¬ 
nates the problem of externalities. 

2. True or false? The distributional consequences of the delineation of 
property rights are eliminated when preferences are quasilinear. 
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3. List some other examples of positive and negative consumption and 
production externalities. 

4. Suppose that the government wants to control the use of the commons, 
what methods exist for achieving the efficient level of use? 


CHAPTER 



INFORMATION 

TECHNOLOGY 


One of the most radical changes in the economy in the last 15 years has 
been the emergence of the information economy. The popular press is 
filled with stories about advances in computer technology, the Internet, and 
new software. Not surprisingly, many of these stories are on the business 
pages of the newspaper, for this technological revolution is also an economic 
revolution. 

Some observers have gone so far as to put the Information Revolution 
on a par with the Industrial Revolution. Just as the Industrial Revolution 
transformed the way goods were produced, distributed, and consumed, the 
Information Revolution is transforming the way information is produced, 
distributed, and consumed. 

It has been claimed that these dramatically new technologies will require 
a fundamentally different form of economics. Bits, it is argued, are fun¬ 
damentally different than atoms. Bits can be reproduced costlessly and 
distributed around the world at the speed of light, and they never deterio¬ 
rate. Material goods, made of atoms, have none of these properties: they 
are costly to produce and transport, and they inevitably deteriorate. 

It is true that the unusual properties of bits require new economic anal¬ 
ysis, but I would argue that they do not require a new kind of economic 
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analysis. After all, economics is primarily about people not goods. The 
models we have analyzed in this book have had to do with how people 
make choices and interact with each other. We have rarely had occasion 
to refer to the specific goods that were involved in the transactions. The 
fundamental concerns were the tastes of the individuals, the technology of 
production, and the structure of the market, and these same factors will 
determine how markets for information will work ... or not work. 

In this chapter we will investigate a few economic models relevant to the 
information revolution. The first has to do with the economics of networks, 
the second with switching costs, and the third with rights management for 
information goods. These examples will illustrate how the fundamental 
tools of economic analysis can help us to understand the world of bits as 
well as the world of atoms. 


35.1 Systems Competition 

Information technology is generally used in systems. Such systems involve 
several components, often provided by different firms, that only have value 
if they work together. Hardware is useless without software, a DVD player 
is useless without DVD disks, an operating system is worthless without 
applications, and a web browser is useless without web servers. All of these 
are examples of complements: goods where the value of one component 
is significantly enhanced by the presence of another component. 

In our discussion of consumer theory, we described left shoes and right 
shoes as complements. The cases above are equally extreme: the best 
computer hardware in the world can’t function unless there is software 
written for it. But unlike shoes, the more software that is available for it, 
the more valuable it becomes. 

Competition among the providers of these components often have to 
worry just as much about their “complementers” as their competitors. 
A key part of Apple’s competitive strategy has to involve their relations 
with software developers. This gives competitive strategy in information 
technology (IT) industries a different flavor than strategy in traditional 
industries. 1 


35.2 The Problem of Complements 

To illustrate these points, let us consider the case of a Central Process¬ 
ing Unit (CPU) and an Operating System (OS). A CPU is an integrated 


1 See Shapiro, Carl and Hal R. Varian, Information Rules: A Strategic Guide to the 
Network Economy , Harvard Business School Press, 1998, for a guide to competitive 
strategy in IT industries. 
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circuit that is the “brain” of a computer. Two familiar manufacturers of 
CPUs are Intel and Motorola. An OS is the software that allows users 
and applications to access the functions of the CPU. Apple and Microsoft 
both make operating systems. Normally, a special version of an operating 
system has to be created for each CPU. 

From the viewpoint of the end user, the CPU can only be used if there 
is a compatible operating system. The CPU and the OS are complements, 
just as left shoes and right shoes are complements. 

The most popular CPUs and OSs in the world today are made by Intel 
and Microsoft, respectively. These are, of course, two separate companies 
that set the prices of their products independently. The PowerPC, another 
popular CPU, was designed by a consortium consisting of IBM, Motorola, 
and Apple. Two commercial operating systems for the PowerPC are the 
Apple OS and IBM’s AIX. In addition to these commercial operating sys¬ 
tems, there are free systems like BSD and GNU-Linux that are provided 
by groups of programmers working on a volunteer basis. 

Let us consider the pricing problem facing sellers of complementary prod¬ 
ucts. The critical feature is that the demand for either product depends 
on the price of both products. If p± is the price of the CPU and p 2 is the 
price of the OS, the cost to the end user depends on p\ + p 2 . Of course, 
you need more than just a CPU and an OS to make a useful system, but 
that just adds more prices to the sum; we’ll keep things simple by sticking 
with two components. 

The demand for CPUs depends on the price of the total system, so we 
write D(pi+p 2 ). If we let ci be the marginal cost of a CPU and F the fixed 
cost, the profit-maximization problem of the CPU maker can be written 

max (pi - Ci)D(pi + p 2 ) - F x . 

Pi 

Similarly, the profit-maximization problem of the OS maker can be written 
max (p 2 - Ci)D(pi + p 2 ) - F 2 . 

P 2 

In order to analyze this problem, let us assume that the demand function 
has the linear form 

D(p) = a — bp. 

Let us also assume, for simplicity, that the marginal costs are so small that 
they can be ignored. Then the CPU profit-maximization problem becomes 

max pi [a - b{pi + p 2 )\ - Fj, 

Pi 


max api — bp\ — bp±p 2 — F\. 

pi 


or 
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It turns out that the marginal revenue from a price increase Api is 

(a — 2bpi — bp 2 ) Api. 

If profit is maximized, then the change in revenue from an increase in pi 
must be zero: 

a — 2bpi — bp 2 = 0. 

Solving this equation we have 


Pi 


a — bp 2 
2b 


In exactly the same way, we can solve for the profit-maximizing choice of 
the OS price: 

a — bpi 

Note that the optimal choice of each firm’s price depends on what it expects 
the other firm to charge for its component. As usual, we are interested in 
a Nash equilibrium, where each firm’s expectations about the other’s 
behavior are satisfied. 

Solving the system of two equations in two unknowns, we have 


Pl=P2 = Tb- 


This gives us the profit-maximizing prices if each firm unilaterally and 
independently sets the price of its component of the system. The price of 
the total system is 

2a 

Pl+Pj= 3t' 

Now let us consider the following experiment. Suppose that the two 
firms merge to form an integrated firm. Instead of setting the prices of the 
components, the integrated firm sets the price of the final system, which 
we denote by p. Its profit-maximization problem is therefore 


max p(a — bp). 

p 

The marginal revenue from increasing the system price by A p is 


(a — 2bp) Ap. 


Setting this equal to zero and solving, we find that the price that the 
integrated firm will set for the final system is 
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Note the following interesting fact: the profit-maximizing price set by 
the integrated firm is less than the profit-maximizing price set by the two 
independent firms. Since the price of the system is lower, consumers will 
buy more of them and be better off. Furthermore, the profits of the inte¬ 
grated firm are larger than the sum of the equilibrium profits of the two 
independent firms. Everyone has been made better off by coordinating the 
pricing decision! 

This turns out to be true in general: a merger of two monopolies that 
produce complementary products results in lower prices and higher profits 
than if the two firms set their prices independently. 

The intuition is not hard to see. When firm 1 contemplates a price 
decrease for the CPU, it will increase demand for CPUs and OSs. But it 
only takes into account the impact on its own profit from cutting price, 
ignoring the profits that will accrue to the other firm. This leads it to cut 
prices less than it would if it were interested in maximizing joint profit. 
The same analysis applies to firm 2, leading to prices that are “too high” 
from the viewpoint of both profit-maximization and consumer surplus. 


Relationships among Complementors 

The “merger of complementors” analysis is provocative, but we shouldn’t 
immediately leap to the conclusion that mergers of OS and CPU manu¬ 
facturers are a good idea. What the result says is that independent price 
setting will lead to prices that are too high from the viewpoint of joint 
profitability, but there are lots of intermediate cases between totally inde¬ 
pendent and fully integrated. 

For example, one of the firms can negotiate prices for components and 
then sell an integrated bundle. This is, more or less, what Apple does. They 
buy PowerPC CPUs in bulk from Motorola, build them into computers, and 
then bundle the operating system and computers together for sale to the 
end customers. 

Another model for dealing with the systems pricing problem is to use 
revenue sharing. Boeing builds airplane bodies and GE builds airplane 
engines. The end user generally wants both a body and an engine. If 
GE and Boeing each set their prices independently, they could decide to 
set their prices too high. So what they do instead is to negotiate a deal 
in which GE will receive a fraction of the revenue from the sale of the 
assembled aircraft. Then GE is happy to have Boeing negotiate to get as 
high a price as possible for the package, confident that it will receive its 
specified share. 


2 


This rather remarkable fact was discovered by Augustin Cournot, whom we previously 
met in Chapter 27. 
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There are other mechanisms that work in different industries. Consider, 
for example, the DVD industry mentioned in the introduction. This has 
been a very successful new product, but making it work was tricky. Con¬ 
sumer electronics firms didn’t want to produce players unless they were as¬ 
sured that there would be plenty of content available, and content providers 
didn’t want to produce content unless they were sure that would be lots of 
DVD players out there. 

On top of this, both the consumer electronics firms and the content 
producers would have to worry about the pricing of complements problem: 
if there were only a few providers of players and only a few providers of 
content, then they would each want to price their products “too high,” 
reducing the total profit available in the industry and making consumers 
worse off. 

Sony and Philips, who held the basic patents on the DVD technology, 
helped solve this problem by licensing the technology widely at attractive 
prices. They also realized that there had to be a lot of competition to keep 
the prices down and kick start the industry. They recognized that it was 
much better to have a small share of a large, successful industry than to 
have a large share of a nonexistent industry. 

Yet another model for relationships among complementers might be 
called “commoditize the complement.” Look back at firm l’s profit maxi¬ 
mization problem: 

maxpiD(pi + p 2 ) — Fi. 

pi 

At any given configuration of prices, reducing p\ may or may not increase 
firm l’s revenues, depending on the demand elasticity. But lowering p 2 will 
always increase firm l’s revenue. The challenge facing firm 1 is then: how 
can I get firm 2 to cut its price? 

One way is to try to make competition for firm 2 more intense. Vari¬ 
ous strategies are possible here, depending on the nature of the industry. 
In technology-intensive industries, standardization becomes an important 
tool. An OS producer, for example, would want to encourage standardized 
hardware. This not only makes its job easier, but it also ensures that the 
hardware industry will be highly competitive. This will ensure that com¬ 
petitive forces push down the price of hardware and reduce the total system 
price to end users, thereby increasing the demand for operating systems. 3 

EXAMPLE: Apple's iPod and iTunes 

Apple’s iPod music player is hugely popular. As of January 2009, Apple 
had sold 6 billion songs, accounting for an estimated 70% of online music 
sales and an 88% market share in the United States. 

3 See Brandenburger, Adam and Barry Nalebuff, Co-opetition, Doubleday, 1997 for 

further analysis of strategy for complementers. 
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There is an obvious complementary relationship between the music player 
and the music. The classic business model for complements comes from 
Gillette: “Give away the razor and sell the blades.” But in this case the 
model is reversed: most of Apple’s profit comes from selling the iPod, with 
only a small fraction coming from selling the music. 

This is primarily due to the fact that Apple does not own the music, 
so the revenue from music sold on iTunes must be shared between the 
producers of the music and Apple. Since Apple makes most of its money 
from the player, it wants to have cheap music. Since the studios make most 
of their money from the songs, they want to have expensive music. This 
has led to some conflicts between Apple and the music studios. 

Originally, all songs on iTunes sold for 99 cents. Some music publishers 
felt that prices should be higher for new releases. After much back and 
forth, Apple announced a new policy in March of 2009, where some new 
releases would sell for $1.29. This is a form of differential pricing, or “ver¬ 
sioning,” which is common in media markets. Those who are enthused and 
impatient pay the higher price, while those who are more patient can wait 
for the price reduction. 


EXAMPLE: Who Makes an iPod? 

Hint: it’s not Apple. In fact, iPods are assembled in a number of Asian 
countries, by a variety of assemblers, including Asustek, Inventec Appli¬ 
ances, and Foxconn. 

But that’s not the end of the story. These companies merely assemble the 
parts that are purchased from other companies. Recently, some economists 
tried to track down the origin of the 451 parts that go into an iPod. 4 

The retail value of the 30-gigabyte video iPod that the authors examined 
was $299. The most expensive component in it was the hard drive, which 
was manufactured by Toshiba and costs about $73. The next most costly 
components were the display module (about $20), the video/multimedia 
processor chip ($8), and the controller chip ($5). They estimated that the 
final assembly, done in China, cost only about $4 a unit. 

The authors of the report tried to track down where the major parts 
were manufactured and how much value was added at each stage of the 
production process. The researchers estimated that $163 of the iPod’s $299 
retail value in the United States was captured by American companies and 
workers, breaking it down to $75 for distribution and retail costs, $80 to 
Apple, and $8 to various domestic component makers. Japan contributed 


4 Greg Linden, Kenneth L. Kraemer, and Jason Dedrick, “Who Captures Value in 
a Global Innovation Network,” Communications of the ACM , 52 (3), March 2009, 
140-144. 
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about $26 to the value added (mostly via the Toshiba disk drive), while 
Korea contributed less than $1. 

Ideally, each component was purchased from the lowest-cost provider, 
and to a large extent these decisions reflected the comparative advantage 
of the different providers. 

Even though the assembly in China only contributed about 1% of the 
value of the iPod, the iPod contributed about $150 of the bilateral trade 
deficit between China and the United States. What this shows is that the 
bilateral trade deficit makes no sense. Most of the high-value parts in the 
iPod were in fact imported into China from other countries in the first place. 
The highest-value component of the iPod -the design and engineering that 
went into it—came from the United States. 

EXAMPLE: AdWords and AdSense 

Two of Google’s advertising programs are AdWords, which shows ads tar¬ 
geted to search queries, and AdSense, which shows ads based on the con¬ 
tents of a web page. AdWords shows “search targeted ads” and AdSense 
shows “contextually targeted ads.” 

When a user clicks on a contextually targeted ad on a particular site, the 
advertiser pays a price per click determined by an auction, similar to that 
described in Chapter 17. The revenue from this ad click is divided between 
the publisher and Google according to a revenue-sharing formula. Hence 
the AdSense program provides a simple way for a publisher to generate 
advertising revenue without having to manage an advertising program on 
its own. 

There is a strong complementarity between the AdWords and AdSense 
programs. By providing a way for publishers to make money from their 
content, AdSense encourages the production of content. This means that 
there is more useful information available on the web and therefore content 
for Google to index and search. By creating a business model for content 
creation, Google makes its search service more valuable. 


35.3 Lock-In 

Since IT components often work together as systems, switching any one 
component often involves switching others as well. This means that the 
switching costs associated with one component in IT industries may be 
quite substantial. For example, switching from a Macintosh to a Windows- 
based PC involves not only the hardware costs of the computer itself, but 
also involves purchasing of a whole new library of software, and, even more 
importantly, learning how to use a brand new system. 

When switching costs are very high, users may find themselves experi¬ 
encing lock-in, a situation where the cost of changing to a different system 
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is so high that switching is virtually inconceivable. This is bad for the con¬ 
sumers, but is, of course, quite attractive for the seller of the components 
that make up the system in question. Since the locked-in user has a very 
inelastic demand, the seller(s) can jack up the prices of their components 
to extract consumer surplus from the user. 

Of course, wary consumers will try to avoid such lock-in, or, at the very 
least, bargain hard to be compensated for being locked in. Even if the 
consumers themselves are poor at bargaining, competition among sellers of 
systems will force prices down for the initial purchase, since the locked-in 
consumers can provide them with a steady revenue stream afterwords. 

Consider, for example, choosing an Internet service provider (ISP). Once 
you have committed to such a choice, it may be inconvenient to switch 
due to the cost of notifying all of your correspondents about your new e- 
mail address, reconfiguring your Internet access programs, and so on. The 
monopoly power due to these switching costs means that the ISP can charge 
more than the marginal cost of providing service, once it has acquired you 
as a customer. But the flip side of this effect is that the stream of profits of 
the locked-in customers is a valuable asset, and ISPs will compete up front 
to acquire such customers by offering discounts and other inducements to 
sign up with them. 

A Model of Competition with Switching Costs 

Let’s examine a model of this phenomenon. We assume that the cost of 
providing a customer with Internet access is c per month. We also assume 
a perfectly competitive market, with many identical firms, so that in the 
absence of any switching costs, the price of Internet service would simply 
be p = c. 

But now suppose that there is a cost s of switching ISPs and that ISPs 
can offer a discount of size d for the first month to attract new customers. 
At the start of a given month, a consumer contemplates switching to a new 
ISP. If he does so, he only has to pay the discounted price, p — d, but he 
also has to endure the switching costs s. If he stays with his old provider, 
he has to pay the price p forever. After the first month, we assume that 
both providers continue to charge the same price p forever. 

The consumer will switch if the present value of the payments to the 
new provider plus the switching cost is less than the present value of the 
payments to the original ISP. Letting r be the (monthly) interest rate, the 
consumer will switch if 

(p- d) + - + s < p+~. 

r r 

Competition between providers ensures that the consumer is indifferent 
between switching or not switching, which implies 

(p — d) + s = p. 
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It follows that d = s, which means the discount offered just covers the 
switching cost of the consumer. 

On the producer side, we suppose that competition forces the present 
value of profits to be zero. The present value of profit associated with a 
single customer is initial discount, plus the present value of the profits in 
future months. Letting r be the (monthly) interest rate, and using the fact 
that d = s, the zero-profit condition can be written as 

(p— s) — c +-—- = 0. (35.1) 

r 

Rearranging this equation gives us two equivalent ways to describe the 
equilibrium price: 

p — c+--= s, (35.2) 

r 

or 

p = c-\ -s. (35.3) 

1 + r 

Equation (35.2) says that the present value of the future profits from the 
consumer must just equal the consumer’s switching cost. Equation (35.3) 
says that the price of service is a markup on marginal cost, with the amount 
of the markup is proportional to the switching costs. 

Adding switching costs to the model raises the monthly price of service 
above cost, but competition for this profit flow forces the initial price down. 
Effectively, the producer is investing in the discount d = s in order to 
acquire the flow of markups in the future. 

In reality many ISPs have other sources of revenue than just the monthly 
income from their customers. America Online, for example, derives a sub¬ 
stantial part of its operating revenue from advertising. It makes sense for 
them to offer large up-front discounts, in order to capture advertising rev¬ 
enue, even if they have to provide Internet connections at rates at or below 
cost. 

We can easily add this effect to the model. If a is the advertising revenue 
generated by the consumer each month, the zero-profit condition requires 

(p— s) + a — c+ P Q —- = 0. (35.4) 

r 


Solving for p we have 


p = c — a 


r 


1 + r 


s. 


This equation shows that what is relevant is the net cost of servicing the 
customer, c — o, which involves both the service cost and the advertising 


revenues. 
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EXAMPLE: Online Bill Payment 

Many banks offer low-cost or even free bill payment services. Some banks 
will even pay customers who start using their online bill payment services. 

Why the big rush to pay bills online? The answer is that banks have 
found that once a customer goes to the trouble of setting up the bill-paying 
service, he or she is much less likely to switch banks. According to a Bank 
of America study, the frequency of switching goes down by 80 percent for 
such customers. 5 

It’s true that once you get online bill payment up and running, it’s hard 
to give it up. Switching to another bank to get an extra tenth of a percent 
of interest on your checking account doesn’t seem very attractive. As in 
the analysis of lock-in presented above, investing in services that create 
switching costs can be very profitable for businesses. 


EXAMPLE: Number Portability on Cell Phones 

At one time, cell phone providers prevented individuals from transferring 
their phone numbers when they switched carriers. This prohibition in¬ 
creases individual switching costs significantly, since anyone who switched 
would have to notify all of his or her friends about the new number. 

As the model presented in this chapter describes, the fact that customers 
could be charged more when they faced high switching costs meant that 
the phone providers would compete even more aggressively to sign up such 
highly profitable customers. This competition took the form of providing 
low-cost or even free phones, along with offers of “free minutes,” “rollover 
plans,” “cell-to-cell discounts,” and other marketing gimmicks. 

The cell phone industry was united in its efforts to block number porta¬ 
bility and lobbied regulatory agencies and Congress to maintain the status 
quo. 

Slowly but surely, the tide started to turn against the cell phone industry 
as consumers demanded number portability. The Federal Communications 
Commission, which regulates the telephone business, started dropping hints 
that cell phone providers should consider ways in which they could imple¬ 
ment number portability. 

In June 2003, Verizon Wireless said it would drop opposition to number 
portability. Their decision appeared to rest on two considerations. First, 
it was becoming clear that they were fighting a losing battle: eventually 
cell-number portability would win out. Perhaps more significantly, several 
recent consumer surveys showed that Verizon led the industry in terms of 

5 Michelle Higgins, “Banks Use Online Bill Payment In Effort to Lock In Customers,” 

Wall Street Journal, September 4, 2002. 
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customer satisfaction. It appeared quite possible that Verizon would gain 
more customers than it lost if switching costs were reduced. Indeed, it 
appears that ultimately Verizon benefited from number portability. 

This episode provides a good lesson in business strategy: tactics to in¬ 
crease customer switching costs may be valuable for a while. But ultimately 
service quality plays a decisive role in attracting and retaining customers. 


35.4 Network Externalities 

We have already examined the idea of externalities in Chapter 34. Recall 
that economists use this term to describe situations in which one person’s 
consumption directly influences another person’s utility. Network exter¬ 
nalities are a special kind of externalities in which one person’s utility for 
a good depends on the number of other people who consume this good. 6 

Take for example a consumer’s demand for a fax machine. People want 
fax machines so they can communicate with each other. If no one else has 
a fax machine, it certainly isn’t worthwhile for you to buy one. Modems 
have a similar property: a modem is only useful if there is another modem 
somewhere that you can communicate with. 

Another more indirect effect for network externalities arises with com¬ 
plementary goods. There is no reason for a video store to locate in a 
community where no one owns a video player; but then again, there is 
little reason to buy a video player unless you have access to pre-recorded 
video tapes to play in the machine. In this case the demand for video tapes 
depends on the number of VCRs, and the demand for VCRs depends on 
the number of video tapes available, resulting in a slightly more general 
form of network externalities. 


35.5 Markets with Network Externalities 

Let us try to model network externalities using a simple demand and supply 
model. Suppose that there are 1000 people in a market for some good and 
we index the people by v = 1,..., 1000. Think of v as measuring the 
reservation price for the good by person v. Then if the price of the good 
is p , the number of people who think that the good is worth at least p is 
1000 — p. For example, if the price of the good is $200, then there are 
800 people who are willing to pay at least $200 for the good, so the total 
number of units sold would be 800. This structure generates a standard, 
downward-sloping demand curve. 


6 More generally, a person’s utility could depend on the identity of other users; it is 
easy to add this to the analysis. 
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But now let’s add a twist to the model. Suppose that the good we are 
examining exhibits network externalities, like a fax machine or a telephone. 
For simplicity, let us suppose that the value of the good to person v is vn, 
where n is the number of people who consume the good—the number of 
people who are connected to the network. The more people there are who 
consume the good, the more each person is willing to pay to acquire it. 7 
What does the demand function look like for this model? 

If the price is p , there is someone who is just indifferent between buying 
the good and not buying it. Let v denote the index of this marginal indi¬ 
vidual. By definition, he is just indifferent to purchasing the good, so his 
willingness to pay for the good equals its price: 

p = vn. (35.5) 

Since this “marginal person” is indifferent, everyone with a higher value 
of v than v must definitely want to buy. This means that the number of 
people who want to buy the good is 

n = 1000 - v. (35.6) 

Putting equations (35.5) and (35.6) together, we have a condition that 
characterizes equilibrium in this market: 

p = n(1000 — n). 

This equation gives us a relationship between the price of the good and the 
number of users. In this sense, it is a kind of demand curve; if there are n 
people who purchase the good, then the willingness to pay of the marginal 
individual is given by the height of the curve. 

However, if we look at the plot of this curve in Figure 35.1, we see 
that it has quite a different shape than a standard demand curve! If the 
number of people who connect is low, then the willingness to pay of the 
marginal individual is low, because there aren’t many other people out 
there that he can communicate with. If there are a large number of people 
connected, then the willingness to pay of the marginal individual is low, 
because everyone else who valued it more highly has already connected. 
These two forces lead to the humped shape depicted in Figure 35.1. 

Now that we understand the demand side of the market, let’s look at 
the supply side. To keep things simple, let us suppose that the good can 
be provided by a constant returns to scale technology. As we’ve seen, this 
means that the supply curve is a flat line at price equals average cost. 

Note that there are three possible intersections of the demand and supply 
curve. There is a low-level equilibrium where n* = 0. This is where no one 

7 We should really interpret n as the number of people who are expected to consume 
the good, but this distinction won’t be very important for what follows. 
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Figure 

35.1 
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Network externalities. The demand is given by the curved 
hump, the supply by the horizontal line. Note that there are 
three intersections where demand equals supply. 


consumes the good (connects to the network), so no one is willing to pay 
anything to consume the good. This might be referred to as a “pessimistic 
expectations” equilibrium. 

The middle equilibrium with a positive but small number of consumers 
is one where people don’t think the network will be very big, so they aren’t 
willing to pay that much to connect to it—and therefore the network isn’t 
very big. 

Finally the last equilibrium has a large number of people, nn- Here the 
price is small because the marginal person who purchases the good doesn’t 
value it very highly, even though the market is very large. 


35.6 Market Dynamics 


Which of the three equilibria will we see occur? So far the model gives 
us no reason to choose among them. At each of these equilibria, demand 
equals supply. However, we can add a dynamic adjustment process to help 
us decide which equilibrium is more likely to occur. 

It is plausible to assume that when people are willing to pay more than 
the cost of the good, the size of the market expands and, when they are 
willing to pay less, the market contracts. Geometrically this is saying that 
when the demand curve is above the supply curve, the quantity goes up 
and, when it is beneath the supply curve, the quantity goes down. The 
arrows in Figure 35.1 illustrate this adjustment process. 
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These dynamics give us a little more information. It is now evident 
that the low-level equilibrium, where no one connects, and the high-level 
equilibrium, where many people connect, are stable whereas the middle 
equilibrium is unstable. Hence it is unlikely that the final resting point of 
the system will be the middle equilibrium. 

We are now left with two possible stable equilibria; how can we tell which 
is likely to occur? One idea is to think about how costs might change over 
time. For the kinds of examples we have discussed—faxes, VCRs, computer 
networks, and so on -it is natural to suppose that the cost of the good starts 
out high and then decreases over time due to technological progress. This 
process is illustrated in Figure 35.2. At a high unit cost there is only one 
stable equilibrium—where demand equals zero. When the cost decreases 
sufficiently, there are two stable equilibria. 


WILLINGNESS 
TO PAY 


High cost 


Demand curve 


Low cost 


SIZE OF NETWORK 


Cost adjustment and network externalities. When the 
cost is high, the only equilibrium implies a market of size zero. 
As the cost goes down, other equilibria become possible. 


Figure 

35.2 


Now add some noise to the system. Think of perturbing the number of 
people connected to the network around the equilibrium point of n* = 0. 
These perturbations could be random, or they could be part of business 
strategies such as initial discounts or other promotions. As the cost gets 
smaller and smaller, it becomes increasingly likely that one of these pertur¬ 
bations will kick the system up past the unstable equilibrium. When this 
happens, the dynamic adjustment will push the system up to the high-level 
equilibrium. 

A possible path for the number of consumers of the good is depicted in 
Figure 35.3. 













682 INFORMATION TECHNOLOGY (Ch. 35) 


It starts out at essentially zero, with a few small perturbations over time. 
The cost decreases, and at some point we reach a critical mass that kicks 
us up past the low-level equilibrium and the system then zooms up to the 
high-level equilibrium. 


Figure 

35.3 
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Possible adjustment to equilibrium. The number of users 

connected to the network is initially small, and increases only 

gradually as costs fall. When a critical mass is reached, the 

network growth takes off dramatically. 


A real-life example of this kind of adjustment is the market for fax ma¬ 
chines. Figure 35.4 illustrates the price and number of fax machines shipped 
over a period of 12 years. 8 


EXAMPLE: Network Externalities in Computer Software 

Network externalities arise naturally in the provision of computer software. 
It is very convenient to be able to exchange data files and tips with other 


This diagram is taken from “Critical Mass and Network Size with Applications to 
the US Fax Market,” by Nicholas Economides and Charles Himmelberg (Discussion 
Paper no. EC-95-11, Stern School of Business, N.Y.U., 1995). See also Michael 
L. Katz and Carl Shapiro, “Systems Competition and Network Effects,” Journal of 
Economic Perspectives , 8 (1994), 93-116, for a nice overview of network externalities 
and their implications. 
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Figure 

35.4 
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Fax market. The demand for fax machines was small for a long 
time since so few people used them. During the mid-eighties the 
price fell significantly and the demand suddenly exploded. 


users of the same software. This gives a significant advantage to the largest 
seller in a given market and leads software producers to invest heavily in 
acquiring market share. 

Examples of this abound. Adobe Systems for example, invested heavily 
in developing a “page description language” called PostScript for desktop 
publishing. Adobe realized clearly that no one would invest the time and 
resources necessary to learn PostScript unless it was the clear “industry 
standard.” So the firm deliberately allowed competitors to “clone” its 
language in order to create a competitive market in PostScript interpreters. 
Adobe’s strategy paid off: several competitors emerged (including one that 
gave its product away) and PostScript became a widely used standard 
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for desktop publishing. Adobe kept a few things proprietary—for instance, 
techniques for displaying fonts at low resolution—and managed to dominate 
the high end of the market. Ironically, Adobe’s market success was due to 
its ability to encourage entry by its competitors! 

In recent years, many software producers have followed this model. Adobe 
itself gives away several software products, such as the Adobe Acrobat 
reader. One of the hot new stock issues of 1995, Netscape Communica¬ 
tions Corporation, acquired the lion’s share of the Web browser market by 
giving away its main product, making it a prime example of a company 
that “lost money on every sale, but made up for it in volume.” 


35.7 Implications of Network Externalities 

The model described above, simple though it is, still yields a number of 
insights. For example, the critical mass issue is very important: if one user’s 
demand depends on how many other users there are, it is very important 
to try to stimulate growth early in the life cycle of a product. Nowadays it 
is quite common to see producers offering very cheap access to a piece of 
software or a communications service in order to “create a market” where 
none existed before. 

Of course, the critical question is how big does the market have to be 
before it can take off on its own? Theory can provide little guidance here; 
everything depends on the nature of the good and the costs and benefits 
the users face in adopting it. 

Another important implication of network externalities is the role played 
by governmental policy. The Internet is a prime example. The Internet 
was originally used only by a few small research labs to exchange data 
hies. In the mid-eighties the National Science Foundation used the Inter¬ 
net technology to connect several large universities to 12 supercomputers 
deployed at various locations. The original vision was that researchers 
at the universities would send data back and forth to the supercomput¬ 
ers. But a fundamental property of communications networks is that if 
you are all connected to the same thing, you are all connected to each 
other. This allowed researchers to send email to each other that had 
nothing to do with the supercomputers. Once a critical mass of users 
had been connected to the Internet, its value to new users increased dra¬ 
matically. Most of these new users had no interest in the supercomputer 
centers, even though this was the original motivation for providing the 
network. 


EXAMPLE: The Yellow Pages 

The familiar local yellow pages phone directories are a $14 billion business. 


IMPLICATIONS OF NETWORK EXTERNALITIES 685 


Ten years ago, it was dominated by telephone companies, who had about 
95 percent of the market. Nowadays, they have only 85 percent. 

The difference is due to competition. Several small upstarts entered 
the market in recent years, taking business away from the local phone 
companies. This is no easy task, as the local business directories exhibit 
a classic form of network effects: it used to be that consumers all used 
the yellow page directory provided by their local phone companies, so local 
merchants were forced to advertise in them. 

One upstart, Yellow Book, managed to overcome the network effects 
by using clever business strategies, such as dramatically undercutting the 
phone companies’ ad rates and distributing its directory just before the lo¬ 
cal phone company’s directory came out. The incumbent providers, think¬ 
ing that their market was secure, dismissed the threat of aggressive new¬ 
comers until it was nearly too late. In the last few years, competition 
has heated up in this industry. This example goes to show that even in¬ 
dustries with strong network effects aren’t immune to competitive forces, 
particularly when the incumbents become overconfident. 


EXAMPLE: Radio Ads 

The “killer app” for radio in 1910 was ship-to-shore communication. Un¬ 
fortunately, radio conversations were not private since they were broadcast 
to anyone who tuned into the right frequency. At some point David Sarnoff 
recognized that this bug might be a feature and offered a “radio music box” 
that sent music over the airwaves. His colleagues were skeptical, saying, 
“The wireless music box has no imaginable commercial value. Who would 
pay for a message sent to nobody in particular?” 

They had a point. Even though people found broadcast radio attractive, 
the industry did not have a business model. How would they make money? 

The magazine Wireless World held a contest in which it proposed 5 
business models for broadcast radio and people voted for their favorite. 
The business models were: 

• support from general taxation; 

• donations from the public; 

• radio hardware makers subsidized the production of radio content; 

• advertising-supported radio; 

• vacuum tube tax used to support content production. 

The winner was the last model: a tax on vacuum tubes. Some of the 
other models are still used today. BBC Radio and TV is supported by a tax 


686 INFORMATION TECHNOLOGY (Ch. 35) 


on TVs, and National Public Radio in the United States is supported by 
donations from the public at large. However, in most countries advertising 
has become the most popular business model. 

In 1922, 30 radio stations were in operation in the United States, and 
a hundred thousand radios were sold. By the next year there were 556 
stations with half-a-million receivers being sold. Radio was on its way. 


35.8 Two-sided Markets 

A two-sided market is a special kind of network effect. Think about the 
case of a new technology, like Blu-ray DVDs. I don’t really care what sort 
of DVD player other people have, so there is no direct network effect. But 
there is a kind of indirect network effect: the more Blu-ray players that are 
sold, the more disks that will become available, and the more disks there 
are available, the more attractive it will be to buy a Blu-ray player. 

One can think of many other examples. Consider a new credit card: the 
more merchants accept the credit card, the more attractive the credit card 
will be to consumers. But the more consumers who adopt the card, the 
more attractive it will be to merchants. 

Or think about Adobe’s PDF platform. The more users who have the 
PDF-viewing software (Acrobat Reader) the more graphics designers will 
want to distribute content in this format, and the more demand there will 
be for the Acrobat Distiller, the software used to create PDF files. 

This last example illustrates an important point: it may pay Adobe to 
give away one product (Reader) in order to encourage demand for another 
product (Distiller). This is as old as “giving away the razor to sell the 
blades,” but since the combination of digital goods and the Internet has 
made distribution so cheap, the strategy has become very common. 

Apple, for example, sells the popular iPod music player. They also dis¬ 
tribute music for the iPod on their iTunes store. According to industry 
reports, Apple makes very little money on the music -most of the profits 
go to the music studios. However, from Apple’s point of view it makes 
sense to give away the blades (songs) to sell the razors (iPods). 


A Model of Two-sided Markets 

Let us generalize the model used in section 34.5 to apply to two-sided 
markets. 

Suppose now that there are two goods. The reservation price for good 
1 is vi and it takes on values v\ = 1,..., 1000. Similarly the reservation 
price for good 2 takes on values v% = 1,..., 1000. 

The total value for good 1 depends on how many people adopt good 
2, and the total value for good 2 depends on how many people adopt 
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good 1, so we write U\ = uin 2 and U 2 = u 2 ni. Finally, there are some 
exogenous prices for supplying good 1 and good 2, which we denote by pi 
and P 2 ■ (You can think of these as costs from a constant-returns to scale 
production process.) 

The marginal adopters for goods 1 and 2 are determined by f)in 2 = Pi 
and D 2 U 1 = P 2 • Everybody who has a value higher than vi will purchase 
good 1, so ni = 1000 — V\. Similarly, n 2 = 1000 — v 2 . 

Putting all these equations together we have 


vi n 2 = Pi 
V2111 = P2 

ni = 1000 — i)i 
112 = 1000 — V 2 


Substituting from equations (3) and (4) into (1) and (2) we find 


(1000 — ni)ri2 = pi 
(1000 - n 2 )ni = p 2 


The first thing we observe is that there is always an equilibrium at ni = 
n 2 = 0. If no one purchases good 1, the value of good 2 will be zero and vice 
versa. To find the other solutions, we plot the two functions. As you might 
guess, generally there will be two solutions as depicted in the example in 
Figure 35.5. There is a low-level equilibrium where little is sold of either 
good and a high-level equilibrium where there are substantial sales of both 
goods. 

The challenge facing a supplier is how to get to the high-level equilibrium. 
One strategy, mentioned above, is to subsidize the production of one of the 
goods. Selling one good below cost can make sense if it leads to a larger 
market and more profits for other goods you sell. 


35.9 Rights Management 

There is much interest these days in new business models for intellectual 
property (IP). IP transactions take a variety of forms: books are sold out¬ 
right and also borrowed from libraries. Videos can either be sold or rented. 
Some software is licensed for particular uses; other software is sold outright. 
Shareware is a form of software in which payment is voluntary. 

Choosing the terms and conditions under which a piece of intellectual 
property is offered is a critical business decision. Should you use copy 
protection? Should you encourage users to share a news item with a friend? 
Should you sell to individuals or use site license? 

Some simple economics helps to understand the relevant issues. Let’s 
consider a purely digital good, such as an online newspaper, so we don’t 
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Figure 

35.5 



"i 


Equilibrium in a two-sided market. In general there will 
be three equilibria in a two-sided market. 


have to worry about marginal cost of production. First let us consider 
behavior under some default set of terms and conditions. The owner of the 
digital good will choose a price and, implicitly, a quantity to sell so as to 
maximize profit: 

max p(y)y (35.7) 

v 


This yields some optimal ( p*,y *). 

Now the seller of the good contemplates liberalizing terms and conditions: 
let’s say extending a trial period of free use from 1 week to 1 month. This 
has two effects on the demand curve. First, it increases the value of the 
product to each of the potential users, shifting the demand curve up. But 
it also may easily result in less of the item being sold, since some users will 
find the longer trial period enough to meet their needs. 

Let us model this by defining the new amount consumed by Y = by, 
where b > 1, and the new demand curve by P{Y) = ap(Y), where a > 1. 
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The new profit-maximization problem now becomes 

max P{Y)y. 

Note that we multiply price times the amount sold, y, not the amount 
consumed, Y. 

Applying the definitions Y = by and P{Y) = ap(Y), we can write this 
as 

Y a 

max ap(Y )—= max —p(Y)Y. 
y 6 Yb 

This maximization problem looks like problem (35.7) except for the con¬ 
stant a/6 in front of the max. This will not affect the optimal choice, so 
we can conclude that Y* = y*. 

This simple analysis allows us to make several conclusions: 

• The amount of the good consumed, Y*, is independent of the terms and 
conditions. 

• The amount of the good produced is y*/b which is less than y*. 

• The profits could go up or down depending on whether a/6 is greater or 
less than 1. Profits go up if the increase in value to the consumers who 
buy the product compensates for the reduced number of buyers. 


EXAMPLE: Video Rental 

Video stores can choose the terms and conditions under which they rent 
videos. The longer you can keep the video, the more valuable it is to you, 
since you have a longer period of time during which you can watch it. But 
the longer you keep the video, the less profit the store makes from it, since 
it is unable to rent it to someone else. The optimal choice for the rental 
period involves trading off these two effects. 

In practice, this has tended to lead to a form of product differentiation. 
New releases are rented for short periods, since the the profits from other 
renters being excluded are very substantial. Older videos are rented for 
longer periods, since there is less cost to the store from the video being 
unavailable. 


35.10 Sharing Intellectual Property 

Intellectual property is often shared. Libraries, for example, facilitate the 
sharing of books. Video stores help people to “share” videos—and charge 
a price for doing so. Interlibrary loan helps libraries share books among 
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themselves. Even textbooks -such as the one you are holding—are shared 
among students from one term to the next via the resale market. 

There is considerable debate in the publishing and library communities 
about the proper role of sharing. Librarians have established an informal 
“rule of five” for interlibrary loan: an item may be loaned out up to five 
times before additional royalty payments should be made to the publisher. 
Publishers and authors have traditionally been unenthusiastic about the 
resale market for books. 

The advent of digital information has made this situation even more 
acute. Digital information can be perfectly reproduced, and “sharing” can 
be taken to new extremes. Recently, a well-known country music singer 
engaged in a vociferous public relations campaign against stores selling 
used CDs. The problem was that CDs do not deteriorate with replay and 
it is possible to buy a CD, tape it, and then sell the CD to the used-CD 
store. 

Let us try to construct a model of this sort of sharing phenomenon. We 
begin with the baseline case in which there is no sharing. In this case a 
video maker chooses to produce y copies of a video to maximize profit: 


max p(y)y — cy — F. 
v 


(35.8) 


As usual, p(y) is the inverse demand function, c is the (constant) marginal 
cost, and F is the fixed cost. Let the profit maximizing output be denoted 
by y n , where the n stands for “no sharing.” 

Now suppose that a video rental market is allowed. In this case the num¬ 
ber of videos viewed will be distinct from the number of copies produced. 
If y is the number of videos produced and each video is shared among k 
viewers, then the number of viewings will be x = ky. (For simplicity we 
are assuming that all copies of the video are rented in this case.) 

We need to specify how the consumers sort themselves into the “clubs” 
that share the videos. The simplest assumption is that the consumers with 
high values associate with each other, and the consumers with low-values 
associate with each other. That is, one club consists of consumers with the 
k highest values, another club consists of the consumers with the next k 
highest values, and so on. (Other assumptions could be used, but this one 
gives a very simple analysis.) 

If y copies of the video are produced, x = ky copies will be viewed, so the 
willingness to pay of the marginal individual will be p(x) = p{ky). However, 
it is clearly the case that there is some inconvenience cost to renting a video 
rather than owning it yourself. Let us denote this “transactions cost” by t, 
so that the willingness to pay of the marginal individual becomes p{x) — t. 

Recall that we have assumed that all copies of the video are shared 
among k users. Therefore the willingness to pay of a video store will just 
be k times the willingness to pay of the marginal individual. That is, if y 
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copies are produced, the willingness to pay of the video store will be 

P(y) = k[p(ky) - t\. (35.9) 

Equation (35.9) contains the two key effects that arise from sharing: the 
willingness to pay goes down since more videos are viewed than are pro¬ 
duced; but the willingness to pay also goes up since the cost of a single 
video is shared among several individuals. 

The profit maximization problem of the producer now becomes 

max P(y)y - cy - F, 
v 


which can be written as 


max k[p(ky) — t]y — cy — F, 


or 


max p(ky)ky — (— + t) ky — F. 
y \k ) 


Recalling that the number of viewings, x , is related to the number pro¬ 
duced, i/, via x = ky , we can also write the maximization problem as 


max p{x)x 

X 



-F. 


Note that this problem is identical to problem (35.8), with the exception 
that the marginal cost is now (c/fc + t) rather than c. 

The close relationship between the two problems is very useful since it 
allows us to make the following observation: profits will be larger when 
rental is possible than when it is not if and only if 

c 

— T t < c. 
k 

Rearranging this condition, we have 


k \ 

- -- ) t < c. 

k + lj 

For large k , the fraction on the left is about 1. Hence the critical issue 
is the relationship between the marginal cost of production, c, and the 
transactions cost of renting, t. 

If the cost of production is large and the cost of renting is small, then 
the most profitable thing for a producer to do is to produce a few copies, 
sell them at a high price, and let the consumers rent. On the other hand, 
if the transactions cost of renting is larger than the cost of production, it is 
more profitable for a producer to have renting prohibited: since renting is 
so inconvenient for the consumers, video stores aren’t willing to pay much 
more for the “shared” videos, and so the producer is better off selling. 
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EXAMPLE: Online Two-sided Markets 

There are several examples of two-sided markets on the Internet. For ex¬ 
ample, eBay serves as a meeting place for those who wish to buy and sell 
collectibles. If you are selling rare coins, for example, you want to offer 
them on a market where there are many potential buyers. Similarly, if you 
are a buyer, you want to go to a market where there are several competing 
sellers. This two-sided network effect tends to lead to a single meeting 
place. In the last several years, eBay has expanded beyond collectibles and 
now sells a variety of mechandise. 

Another set of interesting examples are social networking sites, such as 
Facebook, My Space, Linkedln, and others. Participants want to register 
on sites where their friends have registered. This again leads to a network 
effect—the largest network attracts the most new participants. 

Facebook has seen particularly rapid growth. It was launched in Febru¬ 
ary of 2004 and by December of that year had 1 million active users. In 
September of 2009, Facebook had over 300 million active users worldwide, 
according to statistics from its website. 


Summary 

1. Because information technology works together in systems, it is costly 
to consumers to switch any one component. 

2. If two monopoly providers of complementary products coordinate their 
price setting, then they will both set their prices lower than they would 
than if they set them independently. 

3. This will increase profit for the two monopolists and make consumers 
better off. 

4. There are many ways to achieve this coordination, including merger, 
negotiation, revenue sharing, and commoditization. 

5. In a lock-in equilibrium the discount offered first period is paid for by 
increased prices in future periods. 

6. Network externalities arise when one person’s willingness to pay for a 
good depends on the number of other users of that good. 

7. Models with network externalities typically exhibit multiple equilibria. 
The ultimate outcome often depends on the history of the industry. 

8. Rights management involves a tradeoff between increased value and 
prices versus reduced sales. 
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9. Information goods like books and videos are often rented or shared as 
well as purchased. Rental or purchase can be more profitable depending 
on how transactions costs compare with production costs. 


REVIEW QUESTIONS 


1. If the cost to a customer from switching long-distance carriers is on the 
order of $50, how much should a long-distance carrier be willing to pay to 
acquire a new customer? 

2. Describe how the demand for a word processing package might exhibit 
network externalities. 

3. Suppose that the marginal cost of producing an extra video is zero and 
the transactions cost of renting a video is zero. Does a producer make more 
money by selling the video or by renting it? 


CHAPTER 



PUBLIC GOODS 


In Chapter 34 we argued that for certain kinds of externalities, it was not 
difficult to eliminate the inefficiencies. In the case of a consumption exter¬ 
nality between two people, for example, all one had to do was to ensure that 
initial property rights were clearly specified. People could then trade the 
right to generate the externality in the normal way. In the case of produc¬ 
tion externalities, the market itself provided profit signals to sort out the 
property rights in the most efficient way. In the case of common property, 
assigning property rights to someone would eliminate the inefficiency. 

Unfortunately, not all externalities can be handled in that manner. As 
soon as there are more than two economic agents involved things become 
much more difficult. Suppose, for example, that instead of the two room¬ 
mates examined in the last chapter, we had three roommates—one smoker 
and two nonsmokers. Then the amount of smoke would be a negative 
externality for both of the nonsmokers. 

Let’s suppose that property rights are well defined—say the nonsmokers 
have the right to demand clean air. Just as before, although they have the 
right to clean air, they also have the right to trade some of that clean air 
away in return for appropriate compensation. But now there is a prob¬ 
lem involved—the nonsmokers have to agree among themselves how much 
smoke should be allowed and what the compensation should be. 
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Perhaps one of the nonsmokers is much more sensitive than the other, or 
one of them is much richer than the other. They may have very different 
preferences and resources, and yet they both have to reach some kind of 
agreement to allow for an efficient allocation of smoke. 

Instead of roommates, we can think of inhabitants of a whole country. 
How much pollution should be allowed in the country? If you think that 
reaching an agreement is difficult with only three roommates, imagine what 
it is like with millions of people! 

The smoke externality with three people is an example of a public 
good -a good that must be provided in the same amount to all the af¬ 
fected consumers. In this case the amount of smoke generated will be the 
same for all consumers—each person may value it differently, but they all 
have to face the same amount. 

Many public goods are provided by the government. For example, streets 
and sidewalks are provided by local municipalities. There are a certain 
number and quality of streets in a town, and everyone has that number 
available to use. National defense is another good example; there is one 
level of national defense provided for all the inhabitants of a country. Each 
citizen may value it differently—some may want more, some may want 
less—but they are all provided with the same amount. 

Public goods are an example of a particular kind of consumption exter¬ 
nality: everyone must consume the same amount of the good. They are a 
particularly troublesome kind of externality, for the decentralized market 
solutions that economists are fond of don’t work very well in allocating 
public goods. People can’t purchase different amounts of public defense; 
somehow they have to decide on a common amount. 

The first issue to examine is what the ideal amount of the public good 
should be. Then we’ll discuss some ways that might be used to make social 
decisions about public goods. 


36.1 When to Provide a Public Good? 

Let us start with a simple example. Suppose that there are two roommates, 
1 and 2. They are trying to decide whether or not to purchase a TV. Given 
the size of their apartment, the TV will necessarily go in the living room, 
and both roommates will be able to watch it. Thus it will be a public 
good, rather than a private good. The question is, is it worth it for them 
to acquire the TV? 

Let’s use w\ and W 2 to denote each person’s initial wealth, <?i and g 2 to 
denote each person’s contribution to the TV, and X\ and X2 to denote each 
person’s money left over to spend on private consumption. The budget 
constraints are given by 

xi+gi = wi 

X2 +52 = w 2 . 
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We also suppose that the TV costs c dollars, so that in order to purchase 
it, the sum of the two contributions must be at least c: 

Si + 92 > c. 

This equation summarizes the technology available to provide the public 
good: the roommates can acquire one TV if together they pay the cost c. 

The utility function of person 1 will depend on his or her private con¬ 
sumption, X\ , and the availability of the TV—the public good. We’ll write 
person l’s utility function as u\(x\, G), where G will either be 0, indicating 
no TV, or 1, indicating that a TV is present. Person 2 will have utility 
function u 2 (a ; 2 ;G). Each person’s private consumption has a subscript to 
indicate that the good is consumed by person 1 or person 2, but the public 
good has no subscript. It is “consumed” by both people. Of course, it isn’t 
really consumed in the sense of being “used up”; rather, it is the services 
of the TV that are consumed by the two roommates. 

The roommates may value the services of the TV quite differently. We 
can measure the value that each person places on the TV by asking how 
much each person would be willing to pay to have the TV available. To do 
this, we’ll use the concept of the reservation price, introduced in Chapter 
15. 

The reservation price of person 1 is the maximum amount that person 1 
would be willing to pay to have the TV present. That is, it is that price, 
ri, such that person 1 is just indifferent between paying ri and having 
the TV available, and not having the TV at all. If person 1 pays the 
reservation price and gets the TV, he will have w\ — r\ available for private 
consumption. If he doesn’t get the TV, he will have w\ available for private 
consumption. If he is to be just indifferent between these two alternatives, 
we must have 

ui(w\ - 7 - 1 , 1 ) = u\(wi, 0 ). 

This equation defines the reservation price for person 1—the maximum 
amount that he would be willing to pay to have the TV present. A similar 
equation defines the reservation price for person 2. Note that in general 
the reservation price of each person will depend on that person’s wealth: 
the maximum amount that an individual will be willing to pay will depend 
to some degree on how much that individual is able to pay. 

Recall that an allocation is Pareto efficient if there is no way to make 
both people better off. An allocation is Pareto inefficient if there is some 
way to make both people better off; in this case, we say that a Pareto 
improvement is possible. In the TV problem there are only two sorts of 
allocations that are of interest. One is an allocation where the TV is not 
provided. This allocation takes the simple form {w\, w 2 ,0); that is, each 
person spends his wealth only on his private consumption. 

The other kind of allocation is the one where the public good is provided. 
This will be an allocation of the form (aq, a: 2 ,1), where 


xi = wi - g i 
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X 2 =W 2 - 52 - 

These two equations come from rewriting the budget constraints. They say 
that each individual’s private consumption is determined by the wealth that 
he has left over after making his contribution to the public good. 

Under what conditions should the TV be provided? That is, when is 
there a payment scheme (gi,g 2 ) such that both people will be better off 
having the TV and paying their share than not having the TV? In the 
language of economics, when will it be a Pareto improvement to provide 
the TV? 

It will be a Pareto improvement to provide the allocation (xi,x 2 ,l) if 
both people would be better off having the TV provided than not having 
it provided. This means 


Ml(«Jl,0) < Ui(Xi, 1) 

u 2 (w 2 ,0) < u 2 (x 2 , 1). 

Now use the definition of the reservation prices r± and r 2 and the budget 
constraint to write 


Ui{wi - n, 1) = U\ (wi, 0) < U l(Xi, 1) = Ui{wi - 0i, 1) 
u 2 (w 2 - r 2 , 1) = u 2 (w 2 , 0) < u 2 (x 2 ,1) = u 2 (w 2 - g 2l 1). 

Looking at the left- and the right-hand sides of these inequalities, and 
remembering that more private consumption must increase utility, we can 
conclude that 

w\ - n < W! - gi 
w 2 — r 2 < w 2 — g 2 , 

which in turn implies 

n > gi 

X2 > ff2- 

This is a condition that must be satisfied if an allocation (wi,u;2,0) 
is Pareto inefficient: it must be that the contribution that each person 
is making to the TV is less than his willingness to pay for the TV. If a 
consumer can acquire the good for less than the maximum that he would 
be willing to pay, then the acquisition would be to his benefit. Thus the 
condition that the reservation price exceeds the cost share simply says that 
a Pareto improvement will result when each roommate can acquire the 
services of the TV for less than the maximum that he would be willing to 
pay for it. This is clearly a necessary condition for purchase of the TV to 
be a Pareto improvement. 

If each roommate’s willingness to pay exceeds his cost share, then the 
sum of the willingnesses to pay must be greater than the cost of the TV: 


n + r 2 > gi + g 2 = c. 


( 36 . 1 ) 
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This condition is a sufficient condition for it to be a Pareto improvement 
to provide the TV. If the condition is satisfied, then there will be some 
payment plan such that both people will be made better off by providing 
the public good. If r i + r 2 > c, then the total amount that the roommates 
will be willing to pay is at least as large as the cost of purchase, so they 
can easily find a payment plan ( 31 , 32 ) such that r\ > 31 , r 2 > 32 , and 
3 i + 32 = c. This condition is so simple that you might wonder why we 
went through all the detail in deriving it. Well, there are a few subtleties 
involved. 

First, it is important to note that the condition describing when pro¬ 
vision of the public good will be a Pareto improvement only depends on 
each agent’s willingness to pay and on the total cost. If the sum of the 
reservation prices exceeds the cost of the TV, then there will always exist a 
payment scheme such that both people will be better off having the public 
good than not having it. 

Second, whether or not it is Pareto efficient to provide the public good 
will, in general, depend on the initial distribution of wealth ( 101 , 102 ). This 
is true because, in general, the reservation prices r 1 and r 2 will depend on 
the distribution of wealth. It perfectly possible that for some distributions 
of wealth rq + r 2 > c, and for other distributions of wealth ri + r 2 < c. 

To see how this can be, imagine a situation where one roommate really 
loves the TV and the other roommate is nearly indifferent about acquiring 
it. Then if the TV-loving roommate had all of the wealth, he would be 
willing to pay more than the cost of the TV all by himself. Thus it would be 
a Pareto improvement to provide the TV. But if the indifferent roommate 
had all of the wealth, then the TV lover wouldn’t have much money to 
contribute toward the TV, and it would be Pareto efficient not to provide 
the TV. 

Thus, in general, whether or not the public good should be provided will 
depend on the distribution of wealth. But in specific cases the provision of 
the public good may be independent of the distribution of wealth. For ex¬ 
ample, suppose that the preferences of the two roommates were quasilinear. 
This means that the utility functions take the form 

u\(xi , G) = x\ + v\ ( G) 

U 2 (X2,G) = x 2 + v 2 (G), 

where G will be 0 or 1, depending on whether or not the public good is 
available. For simplicity, suppose that rq(0) = u 2 (0) = 0. This says that 
no TV provides zero utility from watching TV . 1 

In this case the definitions of the reservation prices become 

ui(wi - ri,l) =w 1 -r 1 + V\ (1) = ui(wi, 0) = wi 
u 2 {w 2 - 7-2,1) = w 2 - r 2 + u 2 (l) = u 2 (w 2 , 0) = w 2 , 


1 


Perhaps watching TV should be assigned a negative utility. 
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which implies that the reservation prices are given by 


ri = Ui(l) 
r 2 = v 2 (l). 


Thus the reservation prices are independent of the amount of wealth, and 
hence the optimal provision of the public good will be independent of 
wealth, at least over some range of wealths. 2 


36.2 Private Provision of the Public Good 

We have seen above that acquiring the TV will be Pareto efficient for the 
two roommates if the sum of their willingnesses to pay exceeds the cost of 
providing the public good. This answers the question about efficient allo¬ 
cation of the good, but it does not necessarily follow that they will actually 
decide to acquire the TV. Whether they actually decide to acquire the TV 
depends on the particular method they adopt to make joint decisions. 

If the two roommates cooperate and truthfully reveal how much they 
value the TV, then it should not be difficult for them to agree on whether 
or not they should buy the TV. But under some circumstances, they may 
not have incentives to tell the truth about their values. 

For example, suppose that each person valued the TV equally, and that 
each person’s reservation price was greater than the cost, so that r\ > c 
and r 2 > c. Then person 1 might think that if he said he had 0 value for 
the TV, the other person would acquire it anyway. But person 2 could 
reason the same way! One can imagine other situations where both people 
would refuse to contribute in the hopes that the other person would go out 
and unilaterally purchase the TV. 

In this kind of situation, economists say that the people are attempting 
to free ride on each other: each person hopes that the other person will 
purchase the public good on his own. Since each person will have full use 
of the services of the TV if it is acquired, each person has an incentive to 
try to pay as little as possible toward the provision of the TV. 


36.3 Free Riding 


Free riding is similar, but not identical, to the prisoner’s dilemma that we 
examined in Chapter 28. To see this, let us construct a numerical example 
of the TV problem described above. Suppose that each person has a wealth 
of $500, that each person values the TV at $100, and that the cost of the 


2 Even this will only be true for some ranges of wealth, since we must always require 
that ri < uii and r 2 < u’ 2 i.e., the willingness to pay is less than the ability to pay. 
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TV is $150. Since the sum of the reservation prices exceeds the cost, it is 
Pareto efficient to buy the TV. 

Let us suppose that there is no way for one of the roommates to exclude 
the other one from watching the TV and that each roommate will decide 
independently whether or not to buy the TV. Consider the decision of one 
of the roommates, Player A. If he buys the TV, he gets benefits of $100 
and pays a cost of $150, leaving him with net benefits of —50. However, if 
Player A buys the TV, Player B gets to watch it for free, which gives B a 
benefit of $100. The payoffs to the game are depicted in Table 36.1. 


Table 

36.1 


Free riding game matrix. 



" 

Player B 

Buy 

Don't buy 

Player A 

Buy 

-50, 

-50, 1C 

Don't buy 

100, -50 

0, 0 


The dominant strategy equilibrium for this game is for neither player to 
buy the TV. If player A decides to buy the TV, then it is in player B’s 
interest to free ride: to watch the TV but not contribute anything to paying 
for it. If player A decides not to buy, then it is in player B’s interest not 
to buy the TV either. This is similar to the prisoners’ dilemma, but not 
exactly the same. In the prisoners’ dilemma, the strategy that maximizes 
the sum of the players’ utilities is for each player to make the same choice. 
Here the strategy that maximizes the sum of the utilities is for just one of 
the players to buy the TV (and both players to watch it). 

If Player A buys the TV and both players watch it, we can construct a 
Pareto improvement simply by having Player B make a “sidepayment” to 
Player A. For example, if Player B gives Player A $51, then both players 
will be made better off when Player A buys the TV. More generally, any 
payment between $50 and $100 will result in a Pareto improvement for this 
example. 

In fact, this is probably what would happen in practice: each player 
would contribute some fraction of the cost of the TV. This public goods 
problem is relatively easy to solve, but more difficult free riding problems 
can arise in the sharing of other household public goods. For example, what 
about cleaning the living room? Each person may prefer to see the living 
room clean and is willing to do his part. But each may also be tempted 
to free ride on the other—so that neither one ends up cleaning the room, 
with the usual untidy results. 
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The situation becomes even worse if there are more than just two people 
involved—since there are more people on whom to free ride! Letting the 
other guy do it may be optimal from an individual point of view, but it is 
Pareto inefficient from the viewpoint of society as a whole. 


36.4 Different Levels of the Public Good 


In the above example, we had an either/or decision: either provide the TV 
or not. But the same kind of phenomena occurs when there is a choice of 
how much of the public good to provide. Suppose, for example, that the 
two roommates have to decide how much money to spend on the TV. The 
more money they decide to spend, the better the TV they can get. 

As before we’ll let X\ and X2 measure the private consumption of each 
person and g\ and <?2 be their contributions to the TV. Let G now measure 
the “quality” of the TV they buy, and let the cost function for quality be 
given by c(G). This means that if the two roommates want to purchase a 
TV of quality G, they have to spend c(G) dollars to do so. 

The constraint facing the roommates is that the total amount that they 
spend on their public and private consumption has to add up to how much 
money they have: 

X\+ X2 + c(G) = W 1 + W 2- 

A Pareto efficient allocation is one where consumer 1 is as well-off as 
possible given consumer 2’s level of utility. If we fix the utility of consumer 
2 at U 2 , we can write this problem as 

max Ui(xi,G ) 

Xi,X2 ,G 

such that rt 2 (x 2 , G) = U 2 
xi + x 2 + c(G) = Wl + w 2 . 

It turns out that the appropriate optimality condition for this problem 
is that the sum of the absolute values of the marginal rates of substitution 
between the private good and the public good for the two consumers equals 
the marginal cost of providing an extra unit of the public good: 

\MRSi\ + \MRS 2 \ = MC(G) 


or, spelling out the definitions of the marginal rates of substitution, 


Ax\ 


AG 


+ 


AX2 


AG 


MU g MU g 


MU x 


MU r 


= AfG(G). 


In order to see why this must be the right efficiency condition, let us 
apply the usual trick and think about what would be the case if it were 
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violated. Suppose, for example, that the sum of the marginal rates of 
substitution were less than the marginal cost: say MC = 1, \MRSi\ = 1/4, 
and \MRS- 2 \ = 1/2. We need to show that there is some way to make both 
people better off. 

Given his marginal rate of substitution, we know that person 1 would be 
willing to accept 1/4 more dollars of the private good for the loss of 1 dollar 
of the public good (since both goods cost $1 per unit). Similarly, person 2 
would accept 1/2 more dollars of the private good for a 1-dollar decrease 
in the public good. Suppose we reduce the amount of the public good and 
offer to compensate both individuals. When we reduce the public good by 
one unit we save a dollar. After we pay each individual the amount he 
requires to allow this change (3/4 = 1/4 + 1/2), we find that we still have 
1/4 of a dollar left over. This remaining money could be shared between 
the two individuals, thereby making them both better off. 

Similarly, if the sum of the marginal rates of substitution were greater 
than 1, we could increase the amount of the public good to make them 
both better off. If \MRS±\ =2/3 and \MRS 2 \ = 1/2, say, this means that 
person 1 would give up 2/3 of a dollar of private consumption to get 1 
unit more of the public good and person 2 would give up 1/2 of a dollar of 
private consumption to get 1 unit more of the public good. But if person 1 
gave up his 2/3 units, and person 2 gave up his 1/2 unit, we would have 
more than enough to produce the extra unit of the public good, since the 
marginal cost of providing the public good is 1. Thus we could give the 
left-over amount back to both people, thereby making them both better off. 

What does the condition for Pareto efficiency mean? One way to inter¬ 
pret it is to think of the marginal rate of substitution as measuring the mar¬ 
ginal willingness to pay for an extra unit of the public good. Then the effi¬ 
ciency condition just says that the sum of the marginal willingnesses to pay 
must equal the marginal cost of providing an extra unit of the public good. 

In the case of a discrete good that was either provided or not provided, 
we said that the efficiency condition was that the sum of the willingnesses 
to pay should be at least as large as the cost. In the case we’re considering 
here, where the public good can be provided at different levels, the efficiency 
condition is that the sum of the marginal willingnesses to pay should equal 
the marginal cost at the optimal amount of the public good. For whenever 
the sum of the marginal willingnesses to pay for the public good exceeds 
the marginal cost, it is appropriate to provide more of the public good. 

It is worthwhile comparing the efficiency condition for a public good 
to the efficiency condition for a private good. For a private good, each 
person’s marginal rate of substitution must equal the marginal cost; for a 
public good, the sum of the marginal rates of substitution must equal the 
marginal cost. In the case of a private good, each person can consume a 
different amount of the private good, but they all must value it the same 
at the margin—otherwise they would want to trade. In the case of a public 
good, each person must consume the same amount of the public good, but 
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they can all value it differently at the margin. 

We can illustrate the public good efficiency condition in Figure 36.1. We 
simply draw each person’s MRS curve and then add them vertically to get 
the sum of the MRS curves. The efficient allocation of the public good will 
occur where the sum of the MRSs equals the marginal cost, as illustrated 
in Figure 36.1. 



Determining the efficient amount of a public good. The 

sum of the marginal rates of substitution must equal the mar¬ 
ginal cost. 


36.5 Quasilinear Preferences and Public Goods 

In general, the optimal amount of the public good will be different at differ¬ 
ent allocations of the private good. But if the consumers have quasilinear 
preferences it turns out that there will be a unique amount of the pub¬ 
lic good supplied at every efficient allocation. The easiest way to see this 
is to think about the kind of utility function that represents quasilinear 
preferences. 

As we saw in Chapter 4, quasilinear preferences have a utility represen¬ 
tation of the form: Ui(xi,G) = Xi + Vi(G). This means that the marginal 


Figure 

36.1 
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utility of the private good is always 1, and thus the marginal rate of substi¬ 
tution between the private and the public good—the ratio of the marginal 
utilities—will depend only on G. In particular: 


\MRSi\ 

\MRS 2 \ 


Au 1 (x 1 ,G)/AG _ Avi(G) 
Aui/Axi AG 

Au 2 (x 2 ,G)/AG _ Av 2 (G) 
Au 2 /Ax 2 AG 


We already know that 
satisfy the condition 


a Pareto efficient level of the public good must 


\MRSil + | MRS 2 \ = MC{G). 


Using the special form of the MRSs in the case of quasilinear utility, we 
can write this condition as 


Aw 1 (G) A v 2 (G) 

AG + AG 


MC(G). 


Note that this equation determines G without any reference to X\ or x 2 . 
Thus there is a unique efficient level of provision of the public good. 

Another way to see this is to think about the behavior of the indifference 
curves. In the case of quasilinear preferences, all of the indifference curves 
are just shifted versions of each other. This means, in particular, that the 
slope of the indifference curves—the marginal rate of substitution—doesn’t 
change as we change the amount of the private good. Suppose that we find 
one efficient allocation of the public and private goods, where the sum of the 
absolute value of the MRSs equals MC(G). Now if we take some amount 
of the private good away from one person and give it to another, the slopes 
of both indifference curves stay the same, so the sum of the absolute value 
of the MRSs is still equal to MC(G) and we have another Pareto efficient 
allocation. 

In the case of quasilinear preferences, all Pareto efficient allocations are 
found by just redistributing the private good. The amount of the public 
good stays fixed at the efficient level. 


EXAMPLE: Pollution Revisited 

Recall the model of the steel firm and the fishery described in Chapter 34. 
There we argued that the efficient provision of pollution was one which 
internalized the pollution costs borne by the steel firm and the fishery. 
Suppose now that there are two fisheries, and that the amount of pollution 
produced by the steel firm is a public good. (Or, perhaps more appropri¬ 
ately, is a public bad!) 
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Then the efficient provision of pollution will involve maximizing the sum 
of the profits of all three firms—that is, minimizing the total social cost of 
the pollution. Formally, let c s (s, x) be the cost to the steel firm of producing 
s units of steel and x units of pollution, and write Cj(fi,x) for the costs 
for firm 1 to catch fi fish when the pollution level is x, and c 2 (f 2 l x) as 
the analogous expression for firm 2. Then to compute the Pareto efficient 
amount of pollution, we maximize the sum of the three firms’ profits: 

max p s s +p/fi +Pfh ~ Cs(s,x) ~ c)(fi,x) - c 2 f (f 2 l x). 

«,/ i,h,x 

The interesting effect for our purposes is the effect on aggregate profits 
of increasing pollution. Increasing pollution lowers the cost of producing 
steel but raises the costs of producing fish for each of the fisheries. The 
appropriate optimality condition from the profit-maximization problem is 

Ac s (s,x) Ac}(/i,x) Ac}(/ 2 ,i) 

As As As 

which simply says that the sum of the marginal costs of pollution over the 
three firms should equal zero. Just as in the case of a public consumption 
good, it is the sum of the marginal benefits or costs over the economic 
agents that is relevant for determining the Pareto efficient provision of a 
public good. 


36.6 The Free Rider Problem 

Now that we know what the Pareto efficient allocations of public goods 
are, we can turn our attention to asking how to get there. In the case of 
private goods with no externalities we saw that the market mechanism will 
generate an efficient allocation. Will the market work in the case of public 
goods? 

We can think of each person as having some endowment of a private 
good, Wi . Each person can spend some fraction of this private good on 
his own private consumption, or he or she can contribute some of it to 
purchase the public good. Let’s use x\ for l’s private consumption, and let 
< 7 i denote the amount of the public good he buys, and similarly for person 2. 
Suppose for simplicity that c(G) = G, which implies that the marginal cost 
of providing a unit of the public good is constant at 1. The total amount 
of the public good provided will be G = g\ + g 2 ■ Since each person cares 
about the total amount of the public good provided, the utility function of 
person i will have the form Ui(xi , <71 + 32 ) = (+, G). 

In order for person 1 to decide how much he should contribute to the 
public good, he has to have some forecast of how much person 2 will con¬ 
tribute. The simplest thing to do here is to adopt the Nash equilibrium 
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model described in Chapter 28, and suppose that person 2 will make some 
contribution g 2 . We assume that person 2 also makes a guess about per¬ 
son l’s contribution, and we look for an equilibrium where each person is 
making an optimal contribution given the other person’s behavior. 

Thus person l’s maximization problem takes the form 

max ui Oi, gi + g 2 ) 

Zl,3l 

such that Xi + gi = W\. 

This is just like an ordinary consumer maximization problem. The op¬ 
timization condition is therefore the same: if both people purchase both 
goods the marginal rate of substitution between the public and the private 
goods should be 1 for each consumer: 

\MRSi \ = 1 

\MRS 2 \ = 1. 

However, we have to be careful here. It is true that if person 2 purchases 
any amount of the public good at all, he will purchase it until the marginal 
rate of substitution equals one. But it can easily happen that person 2 de¬ 
cides that the amount already contributed by person 1 is sufficient and that 
it would therefore be unnecessary for him to contribute anything toward 
the public good at all. 

Formally, we are assuming that the individuals can only make positive 
contributions to the public good—they can put money into the collection 
plate, but they can’t take money out. Thus there is an extra constraint 
on each person’s contributions, namely, that g\ > 0 and g 2 > 0. Each 
person can only decide whether or not he wants to increase the amount 
of the public good. But then it may well be that one person decides that 
the amount provided by the other is just fine and would prefer to make no 
contribution at all. 

A case like this is depicted in Figure 36.2. Here we have illustrated 
each person’s private consumption on the horizontal axis and his or her 
public consumption on the vertical axis. The “endowment” of each person 
consists of his or her wealth, Wi , along with the amount of the public good 
contribution of the other person—since this is how much of the public good 
will be available if the person in question decides not to contribute. Figure 
36.2A shows a case where person 1 is the only contributor to the public 
good, so that g\ = G. If person 1 contributes G units to the public good, 
then person 2’s endowment will consist of her private wealth, w 2l and the 
amount of the public good G —since person 2 gets to consume the public 
good whether or not she contributes to it. Since person 2 cannot reduce the 
amount of the public good, but can only increase it, her budget constraint 
is the bold line in Figure 36.2B. Given the shape of 2’s indifference curve, 
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36.2 


it is optimal from her point of view to free ride on l’s contribution and 
simply consume her endowment, as depicted. 

This is an example where person 2 is free riding on person l’s contribu¬ 
tion to the public good. Since a public good is a good that everyone must 
consume in the same amount, the provision of a public good by any one per¬ 
son will tend to reduce the other peoples’ provision. Thus in general there 
will be too little of the public good supplied in a voluntary equilibrium, 
relative to an efficient provision of the public good. 


36.7 Comparison to Private Goods 

In our discussion of private goods, we were able to show that a particu¬ 
lar social institution—the competitive market—was capable of achieving 
a Pareto efficient allocation of private goods. Each consumer deciding for 
himself or herself how much to purchase of various goods would result in a 
pattern of consumption that was Pareto efficient. A major assumption in 
this analysis was that an individual’s consumption did not affect other peo¬ 
ple’s utility—that is, that there were no consumption externalities. Thus 
each person optimizing with respect to his or her own consumption was 
sufficient to achieve a kind of social optimum. 

The situation is radically different with respect to public goods. In this 
case, the utilities of the individuals are inexorably linked since everyone 
is required to consume the same amount of the public good. In this case 
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the market provision of public goods would be very unlikely to result in a 
Pareto efficient provision. 

Indeed, for the most part we use different social institutions to determine 
the provision of public goods. Sometimes people use a command mech¬ 
anism, where one person or small group of people determines the amount 
of various public goods that will be provided by the populace. Other times 
people use a voting system where individuals vote on the provision of 
public goods. One can well ask the same sorts of questions about voting, 
or other social mechanisms for decision making, that we asked about the 
private market: are they capable of achieving a Pareto efficient allocation 
of public goods? Can any Pareto efficient allocation of public goods be 
achieved by such mechanisms? A complete analysis of these questions is 
beyond the scope of this book, but we will be able to shed a little light on 
how some methods work below. 


36.8 Voting 

Private provision of a public good doesn’t work very well, but there are 
several other mechanisms for social choice. One of the most common mech¬ 
anisms in democratic countries is voting. Let’s examine how well it works 
for the provision of public goods. 

Voting isn’t very interesting in the case of two consumers, so we will sup¬ 
pose that we have n consumers. Furthermore, so as not to worry about ties, 
we’ll suppose that n is an odd number. Let’s imagine that the consumers 
are voting about the size of some public good—say the magnitude of ex¬ 
penditures on public defense. Each consumer has a most-preferred level of 
expenditure, and his valuation of other levels of expenditure depends on 
how close they are to his preferred level of expenditure. 

The first problem with voting as a way of determining social outcomes 
has already been examined in Chapter 33. Suppose that we are considering 
three levels of expenditure, A, B, and C. It is perfectly possible that there 
is a majority of the consumers who prefer A to B, a majority who prefer B 
to C ... and a majority who prefer C to A! 

Using the terminology of Chapter 33, the social preferences generated by 
these consumers are not transitive. This means that the outcome of voting 
on the level of public good may not be well defined—there is always a level 
of expenditure that beats every expenditure. If a society is allowed to vote 
many times on an issue, this means that it may “cycle” around various 
choices. Or if a society votes only once on an issue, the outcome depends 
on the order in which the choices are presented. 

If first you vote on A versus B and then on A versus C, C will be the 
outcome. But if you vote on C versus A and then C versus B, B will be 
the outcome. You can get any of the three outcomes by choosing how the 
alternatives are presented! 
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The “paradox of voting” described above is disturbing. One natural 
thing to do is to ask what restrictions on preferences will allow us to rule 
it out; that is, what form must preferences have so as to ensure that the 
kinds of cycles described above cannot happen? 

Let us depict the preferences of consumer i by a graph like those in 
Figure 36.3, where the height of the graph illustrates the value or the net 
utility for different levels of the expenditure on the public good. The term 
“net utility” is appropriate since each person cares both about the level of 
the public good, and the amount that he has to contribute to it. Higher 
levels of expenditure mean more public goods but also higher taxes in order 
to pay for those public goods. Thus it is reasonable to assume that the net 
utility of expenditure on the public good rises at first due to the benefits 
of the public good but then eventually falls, due to the costs of providing 
it. 

One restriction on preferences of this sort is that they be single-peaked. 
This means that preferences must have the shape depicted in Figure 36.3A 
rather than that depicted in Figure 36.3B. With single-peaked preferences, 
the net utility of different levels of expenditure rises until the most-preferred 
point and then falls, as it does in Figure 36.3A; it never goes up, down, 
and then up again, as it does in Figure 36.3B. 


NET 

UTILITY 


NET 

UTILITY 


Most preferred 
expenditure 


EXPENDITURE EXPENDITURE 


A 


B 


Shapes of preferences. Single-peaked preferences are shown 
in panel A and multiple peaked preferences in panel B. 


Figure 

36.3 


If each individual has single-peaked preferences, then it can be shown 
that the social preferences revealed by majority vote will never exhibit the 
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kind of intransitivity we described above. Accepting this result for the mo¬ 
ment, we can ask which level of expenditure will be chosen if everyone has 
single-peaked preferences. The answer turns out to be the median ex¬ 
penditure —that expenditure such that one-half of the population wants 
to spend more, and one-half wants to spend less. This result is reasonably 
intuitive: if more than one-half wanted more expenditure on the public 
good, they would vote for more, so the only possible equilibrium voting 
outcome is when the votes for increasing and decreasing expenditure on 
the public good are just balanced. 

Will this be an efficient level of the public good? In general, the answer is 
no. The median outcome just means that half the population wants more 
and half wants less; it doesn’t say anything about how much more they 
want of the public good. Since efficiency takes this kind of information 
into account, voting will not in general lead to an efficient outcome. 

Furthermore, even if peoples’ true preferences are single-peaked, so that 
voting may lead to a reasonable outcome, individuals may choose to mis¬ 
represent their true preferences when they vote. Thus people will have an 
incentive to vote differently than their true preferences would indicate in 
order to manipulate the final outcome. 


EXAMPLE: Agenda Manipulation 

We have seen that the outcome of a sequence of votes may depend on the 
order in which the votes are taken. Experienced politicians are well aware 
of this possibility. In the U.S. Congress, amendments to a bill must be 
voted on before the bill itself, and this provides a commonly used way to 
influence the legislative process. 

In 1956 the House of Representatives considered a bill calling for Fed¬ 
eral aid to school construction. One representative offered an amendment 
requiring that the bill would only provide Federal aid to states with in¬ 
tegrated schools. There were three more-or-less equally sized groups of 
representatives with strongly held views on this issue. 

• Republicans. They were opposed to Federal aid to education, but pre¬ 
ferred the amended bill to the original. Their ranking of the alternatives 
was no bill, amended bill, original bill. 

• Northern Democrats. They wanted Federal aid to education and sup¬ 
ported integrated schools, so they ranked the alternatives amended bill, 
original bill, no bill. 

• Southern Democrats. This group wanted Federal aid to education, but 
would not get any aid under the amended bill due to the segregated schools 
in the South. Their ranking was original bill, no bill, amended bill. 
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In the vote on the amendment, the Republicans and the Northern Dem¬ 
ocrats were in the majority, thereby substituting the amended bill for the 
original. In the vote on the amended bill, the Republicans and the South¬ 
ern Democrats were in the majority, and the amended bill was defeated. 
However, before being amended the original bill had a majority of the votes! 


36.9 The Vickrey-CIarke-Groves Mechanism 

Let us think about the public good problem in a very general framework. 
The goal is to choose some outcome (for example, whether or not to provide 
a streetlight) so as to maximize the sum of utilities of the agents involved. 
The challenge is to determine just what those individual utility functions 
are, since consumers may not have good incentives to report true values. 

In the simplest case the choice might be a zero-one decision: if x = 1 
the streetlight is built, if x = 0 it is not. In a more general case, the choice 
might be how much of something to provide—how many streetlights, or 
how bright they are, or where they are located. We will use x to represent 
the possible choices, whatever they may be. We suppose that there are n 
agents, and let Ui(x) be the utility of agent i. The goal is to choose x to 
maximize the sum of the agents’ utility, JV Ui(x). 

This would be easy if the decision maker knew the utility functions. Un¬ 
fortunately, in any realistic situation, the decision maker won’t know this. 
And, as we’ve seen, the agents may well have an incentive to misrepresent 
their true utility functions. 

Somewhat surprisingly, there is a clever way to get the agents to tell the 
truth and achieve an efficient outcome. This economic mechanism is 
known as the Vickrey-CIarke-Groves mechanism, or VCG mecha¬ 
nism. 

Groves Mechanism 

We will describe the VCG mechanism in two stages. First, we describe 
what is known as a Groves mechanism. 

1. The center asks each agent i to report how much he is willing to pay to 
have x units of the public good provided. We denote this reported utility 
for x units of the public good by ?y(x). 

2. The center chooses the level of the public good x* that maximizes the 

sum of the reported utilities, R = 1 ri(x). 

3. Each agent i receives a sidepayment which is the sum of the reported 

utilities of everyone else, evaluated at the level of x determined in step 2. 
Denote this sidepayment by Ri = r j( x *)- 
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It turns out that in this mechanism it is a dominant strategy for each 
agent to report his true utility function. To see why, consider the total 
payoff to agent i, which is his utility plus his sidepayment 


Ui{x) + ^2,rj(x). 


Note that agent i cares about his true utility function but his sidepayment 
depends on the sum of the others’ reported utility functions. 

Agent i recognizes that the decision maker will maximize the sum of 
utilities using his reported utility, 


ri(x) + ^2 r j(x). 


However, agent i wants the decision maker to maximize his own true utility 
plus sidepayment, 





Agent i can ensure that the decision maker makes a choice that will maxi¬ 
mize this expression by reporting his true utility; that is, by setting 7y(x) = 
Ui(x). 

The Groves mechanism essentially “internalizes the externality” among 
the agents. It makes each agent face the costs and benefits that his report 
imposes on the other agents. Each agent wants to report his true utility, 
since that is what he wants to be maximized. 


The VCG Mechanism 

The trouble with the Groves mechanism alone is it is potentially very costly: 
the center has to pay every agent an amount equal to the sum of the others’ 
reported utilities. How can the magnitude of the sidepayments be reduced? 

One important observation is that we can impose a “tax” on each agent 
as long as this tax is independent of the agent’s choice. If the tax is 
independent of i’s choice, then it can’t affect his decision. 3 We will choose 
the tax in a way that guarantees that the net payments the center will 
receive are either positive or zero. Thus the center will always have at least 
as much money as necessary to pay for the public good. 

A particularly convenient tax is to charge agent i an amount equal to the 
maximum sum of reported utilities excluding agent i. That is, we charge 

3 This is where the quasilinear assumption about utility is important. 
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each agent the sum of the reported utilities that would occur if he were not 
present. The net tax imposed on agent i is then 







Note that this number is either positive or zero. Why? Because the max¬ 
imum sum of the n — 1 reported utilities has to be larger than any other 
value for that sum. 

What we are computing here is the difference between what would hap¬ 
pen with agent i present, and what would happen with him absent. Thus 
it measures the net cost that agent i imposes on the other agents. As 
long as i faces the cost that he imposes on the other agents, he will have 
appropriate incentives to report his true utility. 

Now we can complete the description of the VCG mechanism. We use 
steps 1 and 2 above, but then substitute the following steps for step 3 
above. 

3. The center also calculates the outcome that maximizes the sum of the 
n — 1 reported utilities if agent 1,2were not present. Let Wj be the 
maximum sum of reported utilities that results without agent i. 

4. Each agent i pays a tax equal to Wi — Ri. 


36.10 Examples of VCG 


The discussion in the last section was admittedly abstract, so it is helpful 
to examine some specific cases. 

Vickrey Auction 

The first case we look at is the Vickrey auction, as described in Chapter 
17. Here the outcome is simple: which person should get the item being 
auctioned. Let v\ > V 2 be the true values of two bidders and ri > r 2 be 
the reported values. 

If agent 1 is present, he gets a utility of v\. If he is absent, the item is 
awarded to the other agent so agent l’s total payoff is V\ — r 2 . Agent 2 gets 
a payoff of zero no matter what. Each agent has an incentive to report its 
true value, so we end up with the optimal outcome. 

Clarke-Groves Mechanism 

The next example is a public goods problem along the lines of the TV- 
buying game described in Table 36.1. As in that example, suppose that 
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there are two roommates who are trying to decide whether they will buy 
a TV. Let c* be how much agent i will pay if the TV is purchased. Since 
the total cost of the TV is $150, we must have Ci + c 2 = 150. 

According to the VCG mechanism, each agent reports a value for the 
TV, denoted by ry. If rq + r 2 > 150 the TV will be purchased and the 
agents will make the payments according to the mechanism. Let x = 1 if 
the TV is purchased and x = 0 if it is not. 

Before we look at the VCG mechanism, let us think about what would 
happen if we followed a naive mechanism: ask each agent to report his 
value and then acquire the TV if the sum of the reported values exceeds 
the cost of the TV. 

Suppose person l’s value exceeds his cost share, so that tq — Ci > 0. 
Then person 1 may as well report a million dollars; this will ensure the TV 
get purchased, which is what he wants to see. On the other hand if tq < c\ 
person 1 may as well report a negative million dollars. 

The problem is that each agent, acting independently, has no reason 
to take into account the other agent’s values. The agents have a strong 
incentive to exaggerate their reported values one way or the other. 

Let’s see how the VCG mechanism solves this problem. The payoff to 
agent 1 is 

(rq — ci)x + (r 2 — c 2 )x — max y (r 2 — c 2 )y. 

The first term is his net utility from the TV: the value to him minus the 
cost he has to pay. The second term is the reported net utility to his 
roommate. The last term is the maximum utility his roommate would get 
if agent 1 were not present. Since agent 1 can’t influence this, we can just 
ignore it for now. 

Rearranging the first 2 terms we have agent’s payoff as 
[(rq + r 2 ) - (ci + c 2 )]x. 

If this is positive then he can ensure the TV is purchased if he reports 
rq = rq, since then the sum of the reported values will exceed the total 
cost. If this is negative he can ensure that the TV is not purchased by 
reporting rq = rq. Either way, it is optimal to report the true value. The 
same thing is true for agent 2. If both report the truth, the TV will be 
purchased only when rq + v 2 > 150, which is the optimal thing to do. 

Note that agent i will have to make a payment only if he changes the 
social decision. In this case we say agent i is pivotal. The amount of the 
payment a pivotal agent makes is simply the cost that he imposes on the 
other agents. 


36.11 Problems with the VCG 

The VCG mechanism leads to truthtelling and leads to the optimal level 
of the public good. However, it is not without problems. 
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The first problem is that it only works with quasilinear preferences. This 
is because we can’t have the amount that you have to pay influence your 
demand for the public good. It is important that there is a unique optimal 
level of the public good. 

The second problem is that the VCG mechanism doesn’t really generate 
a Pareto efficient outcome. The level of the public good will be optimal, 
but the private consumption could be greater. This is because of the tax 
collection. Remember that in order to have the correct incentives, the 
pivotal people must actually pay some taxes that reflect the harm that 
they do to the other people. And these taxes cannot go to anybody else 
involved in the decision process, since that might affect their decisions. The 
taxes have to disappear from the system. And that’s the problem—if the 
taxes actually have to be paid, the private consumption will end up being 
lower than it could be otherwise, and therefore be Pareto inefficient. 

However, the taxes only have to be paid if someone is pivotal. If there are 
many people involved in the decision, the probability that any one person 
is pivotal may not be very large; thus the tax collections might typically 
be expected to be rather small. 

A third problem with VCG is that it is susceptible to collusion. Consider, 
for example, the public goods problem described above. Suppose that there 
are 3 roommates participating in the TV auction, but two of them collude. 
The colluders agree to each state $1 million as their net benefit from the 
TV. This ensures that the TV will be purchased but since neither of the 
agents is pivotal (i.e., neither of the colluding agents changed the decision) 
then neither one has to pay the tax. 

The final problem concerns the equity and efficiency tradeoff inherent in 
the VCG mechanism. Since the payment scheme must be fixed in advance, 
there will generally be situations where some people will be made worse off 
by providing the public good, even though the Pareto efficient amount of 
the public good will be provided. To say that it is Pareto efficient to provide 
the public good is to say that there is some payment scheme under which 
everyone is better off having the public good provided than not having it. 
But this doesn’t mean that for an arbitrary payment scheme everyone will 
be better off. The Clarke tax ensures that if everyone could be better off 
having the good provided, then it will be provided. But that doesn’t imply 
that everyone will actually be better off. 

It would be nice if there were a scheme that determined not only whether 
or not to provide the public good, but also a Pareto efficient way to pay 
for it—that is, a payment plan that makes everyone better off. However, 
it does not appear that such a general plan is available. 


Summary 


1. Public goods are goods for which everyone must “consume” the same 
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amount, such as national defense, air pollution, and so on. 

2. If a public good is to be provided in some fixed amount or not provided 
at all, then a necessary and sufficient condition for provision to be Pareto 
efficient is that the sum of the willingnesses to pay (the reservation prices) 
exceeds the cost of the public good. 

3. If a public good can be provided in a variable amount, then the necessary 
condition for a given amount to be Pareto efficient is that the sum of the 
marginal willingnesses to pay (the marginal rates of substitution) should 
equal the marginal cost. 

4. The free rider problem refers to the temptation of individuals to let oth¬ 
ers provide the public goods. In general, purely individualistic mechanisms 
will not generate the optimal amount of a public good because of the free 
rider problem. 

5. Various collective decision methods have been proposed to determine the 
supply of a public good. Such methods include the command mechanism, 
voting, and the Clarke tax. 


REVIEW QUESTIONS 

1. Suppose that 10 people live on a street and that each of them is willing 
to pay $2 for each extra streetlight, regardless of the number of streetlights 
provided. If the cost of providing x streetlights is given by c(x ) = x 2 , what 
is the Pareto efficient number of streetlights to provide? 


APPENDIX 


Let’s solve the maximization problem that determines the Pareto efficient allo¬ 
cations of the public good: 

max ui(xi,G ) 

xi,X2,G 

such that 112 ( 2 : 2 , G) = U 2 

Xl + X2 + c(G) = uil + 102 - 

We set up the Lagrangian: 

L = ui(xi, G) — A[u 2 (a: 2 ,G) — u 2 \ — n[xi + x 2 + c(G) — wi — w 2 ] 


and differentiate with respect to xi, X 2 , and G to get 


8 L 

dxi 

8 L 

8x2 

OIL 

8 G 


dui(xi, G ) 
dxi 


— = 0 


.du 2 {x 2 ,G) 

— —'A--- — /I — U 


8 x 2 
8 ui(xi, G) 
8 G 


- A 


8 u 2 (x 2 , G) 


- V 


8 c(G) 


8G 


8 G 


= 0 . 
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If we divide the third equation by /t and rearrange, we get 

1 <9ui(a:i,G) A du 2 {x 2 ,G) _ dc{G ) 

At dG /u dG “ dG ' 

Now solve the first equation for fi to get 

_ dui(xi, G) 

^ dxi 

and solve the second equation for /t/A to get 

/t _ du2(x2, G) 

A dx2 

Substitute these two equations into equation (36.2) to find 

dui(xi,G)/dG du 2 {x 2 ,G)/dG dc(G) 

dui(xi,G)/dxi + du 2 {x 2 ,G)/dx 2 dG 


which is just 
as given in the text. 


MRS i + MRS 2 = MC(G) 


(36.2) 










CHAPTER 



ASYMMETRIC 

INFORMATION 


So far in our study of markets we have not examined the problems raised 
by differences in information: by assumption buyers and sellers were both 
perfectly informed about the quality of the goods being sold in the market. 
This assumption can be defended if it is easy to verify the quality of an 
item. If it is not costly to tell which goods are high-quality goods and which 
are low-quality goods, then the prices of the goods will simply adjust to 
reflect the quality differences. 

But if information about quality is costly to obtain, then it is no longer 
plausible that buyers and sellers have the same information about goods 
involved in transactions. There are certainly many markets in the real 
world in which it may be very costly or even impossible to gain accurate 
information about the quality of the goods being sold. 

One obvious example is the labor market. In the simple models described 
earlier, labor was a homogeneous product—everyone had the same “kind” 
of labor and supplied the same amount of effort per hour worked. This is 
clearly a drastic simplification! In reality, it may be very difficult for a firm 
to determine how productive its employees are. 

Costly information is not just a problem with labor markets. Similar 
problems arise in markets for consumer products. When a consumer buys 
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a used car it may be very difficult for him to determine whether or not it is 
a good car or a lemon. By contrast, the seller of the used car probably has a 
pretty good idea of the quality of the car. We will see that this asymmetric 
information may cause significant problems with the efficient functioning 
of a market. 


37.1 The Market for Lemons 

Let us look at a model of a market where the demanders and suppliers have 
different information about the qualities of the goods being sold. 1 

Consider a market with 100 people who want to sell their used cars and 
100 people who want to buy a used car. Everyone knows that 50 of the cars 
are “plums” and 50 are “lemons.” 2 The current owner of each car knows 
its quality, but the prospective purchasers don’t know whether any given 
car is a plum or a lemon. 

The owner of a lemon is willing to part with it for $1000 and the owner 
of a plum is willing to part with it for $2000. The buyers of the car are 
willing to pay $2400 for a plum and $1200 for a lemon. 

If it is easy to verify the quality of the cars there will be no problems in 
this market. The lemons will sell at some price between $1000 and $1200 
and the plums will sell at some price between $2000 and $2400. But what 
happens to the market if the buyers can’t observe the quality of the car? 

In this case the buyers have to guess about how much each car is worth. 
We’ll make a simple assumption about the form that this guess takes: we 
assume that if a car is equally likely to be a plum as a lemon, then a 
typical buyer would be willing to pay the expected value of the car. Using 
the numbers described above this means that the buyer would be willing 
to pay ±1200 + ±2400 = $1800. 

But who would be willing to sell their car at that price? The owners of 
the lemons certainly would, but the owners of the plums wouldn’t be willing 
to sell their cars—by assumption they need at least $2000 to part with their 
cars. The price that the buyers are willing to pay for an “average” car is 
less than the price that the sellers of the plums want in order to part with 
their cars. At a price of $1800 only lemons would be offered for sale. 

But if the buyer was certain that he would get a lemon, then he wouldn’t 
be willing to pay $1800 for it! In fact, the equilibrium price in this market 
would have to be somewhere between $1000 and $1200. For a price in this 
range only owners of lemons would offer their cars for sale, and buyers 


1 The first paper to point out some of the difficulties in markets of this sort was George 
Akerlof, “The Market for Lemons: Quality Uncertainty and the Market Mechanism,” 
The Quarterly Journal of Economics , 84, 1970, pp. 488-500. He was awarded the 
2001 Nobel Prize in economics for this work. 

2 A “plum” is slang for a good car; a “lemon” is slang for a bad car. 
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would therefore (correctly) expect to get a lemon. In this market, none of 
the plums ever get sold! Even though the price at which buyers are willing 
to buy plums exceeds the price at which sellers are willing to sell them, no 
such transactions will take place. 

It is worth contemplating the source of this market failure. The problem 
is that there is an externality between the sellers of good cars and bad 
cars; when an individual decides to try to sell a bad car, he affects the 
purchasers’ perceptions of the quality of the average car on the market. 
This lowers the price that they are willing to pay for the average car, and 
thus hurts the people who are trying to sell good cars. It is this externality 
that creates the market failure. 

The cars that are most likely to be offered for sale are the ones that 
people want most to get rid of. The very act of offering to sell something 
sends a signal to the prospective buyer about its quality. If too many 
low-quality items are offered for sale it makes it difficult for the owners of 
high-quality items to sell their products. 


37.2 Quality Choice 

In the lemons model there were a fixed number of cars of each quality. Here 
we consider a variation on that model where quality may be determined 
by the producers. We will show how the equilibrium quality is determined 
in this simple market. 

Suppose that each consumer wants to buy a single umbrella and that 
there are two different qualities available. Consumers value high-quality 
umbrellas at $14 and low-quality umbrellas at $8. It is impossible to tell 
the quality of the umbrellas in the store; this can only be determined after 
a few rainstorms. 

Suppose that some manufacturers produce high-quality umbrellas and 
some produce low-quality umbrellas. Suppose further that both high- 
quality and low-quality umbrellas cost $11.50 to manufacture and that 
the industry is perfectly competitive. What would we expect to be the 
equilibrium quality of umbrellas produced? 

We suppose that consumers judge the quality of the umbrellas available 
in the market by the average quality sold, just as in the case of the lemons 
market. If the fraction of high-quality umbrellas is q , then the consumer 
would be willing to pay p = 14<7 + 8(1 — q ) for an umbrella. 

There are three cases to consider. 

Only low-quality manufacturers produce. In this case then the consumers 
would be willing to pay only $8 for an average umbrella. But it costs $11.50 
to produce an umbrella, so none would be sold. 

Only high-quality manufacturers produce. In this case the producers would 
compete the price of an umbrella down to marginal cost, $11.50. The 


QUALITY CHOICE 721 


consumers are willing to pay $14 for an umbrella, so they would get some 
consumers’ surplus. 

Both qualities are produced. In this case competition ensures that the price 
will be $11.50. The average quality available must therefore have a value 
to the consumer of at least $11.50. This means that we must have 

14g + 8(1 — q) > 11.50. 

The lowest value of q that satisfies this inequality is q = 1/12. This means 
that if 7/12 of the suppliers are high-quality the consumers are just willing 
to pay $11.50 for an umbrella. 

The determination of the equilibrium ratio of high-quality producers is 
depicted in Figure 37.1. The horizontal axis measures q, the fraction of 
high-quality producers. The vertical axis measures the consumers’ willing¬ 
ness to pay for an umbrella if the fraction of high-quality umbrellas offered 
is q. Producers are willing to supply either quality of umbrella at a price of 
$11.50, so the supply conditions are summarized by the colored horizontal 
line at $11.50. 

Consumers are willing to purchase umbrellas only if 14 q + 8(1 — q) > 
11.50; the boundary of this region is illustrated by the dashed line. The 
equilibrium value of q is between 7/12 and 1. 

In this market the equilibrium price is $11.50, but the value of the av¬ 
erage umbrella to a consumer can be anywhere between $11.50 and $14, 
depending on the fraction of high-quality producers. Any value of q be¬ 
tween 1 and 7/12 is an equilibrium. 

However, all of these equilibria are not equivalent from the social point 
of view. The producers get zero producer surplus in all the equilibria, due 
to the assumption of pure competition and constant marginal cost, so we 
only have to examine the consumers’ surplus. Here it is easy to see that 
the higher the average quality, the better off the consumers are. The best 
equilibrium from the viewpoint of the consumers is the one in which only 
the high-quality goods are produced. 


Choosing the Quality 

Now let us change the model a bit. Suppose that each producer can choose 
the quality of umbrella that he produces and that it costs $11.50 to produce 
a high-quality umbrella and $11 to produce a low-quality umbrella. What 
will happen in this case? 

Suppose that the fraction of producers who choose high-quality umbrellas 
is q , where 0 < q < 1. Consider one of these producers. If it behaves 
competitively and believes that it has only a negligible effect on the market 
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Figure 

37.1 



Equilibrium quality. The horizontal line represents the sup¬ 
ply conditions: the market is willing to supply any quality of 
umbrella for $11.50. The slanted line represents the demand 
conditions: consumers are willing to pay more if the average 
quality is higher. The market is in equilibrium if the fraction of 
high-quality producers is at least 7/12. 


price and quality, then it would always want to produce only low-quality 
umbrellas. Since this producer is by assumption only a small part of the 
market, it neglects its influence on the market price and therefore chooses 
to produce the more profitable product. 

But every producer will reason the same way and only low-quality um¬ 
brellas will be produced. But consumers are only willing to pay $8 for a 
low-quality umbrella, so there is no equilibrium. Or, if you will, the only 
equilibrium involves zero production of either quality of umbrella! The pos- 
siblity of low-quality production has destroyed the market for both qualities 
of the good! 


37.3 Adverse Selection 

The phenomenon described in the last section is an example of adverse 
selection. In the model we just examined the low-quality items crowded 
out the high-quality items because of the high cost of acquiring information. 
As we just saw, this adverse selection problem may be so severe that it can 
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completely destroy the market. Let’s consider a few other examples of 
adverse selection. 

Consider first an example from the insurance industry. Suppose that 
an insurance company wants to offer insurance for bicycle theft. They do 
a careful market survey and find that the incident of theft varies widely 
across communities. In some areas there is a high probability that a bicycle 
will be stolen, and in other areas thefts are quite rare. Suppose that the 
insurance company decides to offer the insurance based on the average theft 
rate. What do you think will happen? 

Answer: the insurance company is likely to go broke quickly! Think 
about it. Who is going to buy the insurance at the average rate? Not the 
people in the safe communities—they don’t need much insurance anyway. 
Instead the people in the communities with a high incidence of theft will 
want the insurance—they’re the ones who need it. 

But this means that the insurance claims will mostly be made by the 
consumers who live in the high-risk areas. Rates based on the average 
probability of theft will be a misleading indication of the actual experi¬ 
ence of claims filed with the insurance company. The insurance company 
will not get an unbiased selection of customers; rather they will get an 
adverse selection. In fact the term “adverse selection” was first used in the 
insurance industry to describe just this sort of problem. 

It follows that in order to break even the insurance company must base 
their rates on the “worst-case” forecasts and that consumers with a low, 
but not negligible, risk of bicycle theft will be unwilling to purchase the 
resulting high-priced insurance. 

A similar problem arises with health insurance—insurance companies 
can’t base their rates on the average incidence of health problems in the 
population. They can only base their rates on the average incidence of 
health problems in the group of potential purchasers. But the people who 
want to purchase health insurance the most are the ones who are likely to 
need it the most and thus the rates must reflect this disparity. 

In such a situation it is possible that everyone can be made better off 
by requiring the purchase of insurance that reflects the average risk in the 
population. The high-risk people are better off because they can purchase 
insurance at rates that are lower than the actual risk they face and the low- 
risk people can purchase insurance that is more favorable to them than the 
insurance offered if only high-risk people purchased it. 

A situation like this, where the market equilibrium is dominated by a 
compulsary purchase plan, is quite surprising to most economists. We 
usually think that “more choice is better,” so it is peculiar that restricting 
choice can result in a Pareto improvement. But it should be emphasized 
that this paradoxical result is due to the externality between the low-risk 
and high-risk people. 

In fact there are social institutions that help to solve this market ineffi¬ 
ciency. It is commonly the case that employers offer health plans to their 
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employees as part of the package of fringe benefits. The insurance company 
can base its rates on the averages over the set of employees and is assured 
that all employees must participate in the program, thus eliminating the 
adverse selection. 


37.4 Moral Hazard 

Another interesting problem that arises in the insurance industry is known 
as the moral hazard problem. The term is somewhat peculiar, but the 
phenomenon is not hard to describe. Consider the bicycle-theft insurance 
market again and suppose for simplicity that all of the consumers live in 
areas with identical probabilities of theft, so that there is no problem of 
adverse selection. On the other hand, the probability of theft may be 
affected by the actions taken by the bicycle owners. 

For example, if the bicycle owners don’t bother to lock their bikes or use 
only a flimsy lock, the bicycle is much more likely to be stolen than if they 
use a secure lock. Similar examples arise in other sorts of insurance. In 
the case of health insurance, for example, the consumers are less likely to 
need the insurance if they take actions associated with a healthy lifestyle. 
We will refer to actions that affect the probability that some event occurs 
as taking care. 

When it sets its rates the insurance company has to take into account 
the incentives that the consumers have to take an appropriate amount of 
care. If no insurance is available consumers have an incentive to take the 
maximum possible amount of care. If it is impossible to buy bicycle-theft 
insurance, then all bicyclists would use large expensive locks. In this case 
the individual bears the full cost of his actions and accordingly he wants 
to “invest” in taking care until the marginal benefit from more care just 
equals the marginal cost of doing so. 

But if a consumer can purchase bicycle insurance, then the cost inflicted 
on the individual of having his bicycle stolen is much less. After all, if the 
bicycle is stolen then the person simply has to report it to the insurance 
company and he will get insurance money to replace it. In the extreme 
case, where the insurance company completely reimburses the individual 
for the theft of his bicycle, the individual has no incentive to take care at 
all. This lack of incentive to take care is called moral hazard. 

Note the tradeoff involved: too little insurance means that people bear 
a lot of risk, too much insurance means that people will take inadequate 
care. 

If the amount of care is observable, then there is no problem. The insur¬ 
ance company can base its rates on the amount of care taken. In real life it 
is common for insurance companies to give different rates to businesses that 
have a fire sprinkler system in their building, or to charge smokers different 
rates than nonsmokers for health insurance. In these cases the insurance 


MORAL HAZARD AND ADVERSE SELECTION 725 


firm attempts to discriminate among users depending on the choices they 
have made that influence the probability of damage. 

But insurance companies can’t observe all the relevant actions of those 
they insure. Therefore we will have the tradeoff described above: full 
insurance means too little care will be undertaken because the individuals 
don’t face the full costs of their actions. 

What does this imply about the types of insurance contracts that will 
be offered? In general, the insurance companies will not want to offer the 
consumers “complete” insurance. They will always want the consumer to 
face some part of the risk. This is why most insurance policies include a 
“deductible,” an amount that the insured party has to pay in any claim. 
By making the consumers pay part of a claim, the insurance companies can 
make sure that the consumer always has an incentive to take some amount 
of care. Even though the insurance company would be willing to insure 
a consumer completely if they could verify the amount of care taken, the 
fact that the consumer can choose the amount of care he takes implies that 
the insurance company will not allow the consumer to purchase as much 
insurance as he wants if the company cannot observe the level of care. 

This is also a paradoxical result when compared with the standard mar¬ 
ket analysis. Typically the amount of a good traded in a competitive market 
is determined by the condition that demand equals supply—the marginal 
willingness to pay equals the marginal willingness to sell. In the case of 
moral hazard, a market equilibrium has the property that each consumer 
would like to buy more insurance, and the insurance companies would be 
willing to provide more insurance if the consumers continued to take the 
same amount of care ... but this trade won’t occur because if the consumers 
were able to purchase more insurance they would rationally choose to take 
less care! 


37.5 Moral Hazard and Adverse Selection 


Moral hazard refers to situations where one side of the market can’t observe 
the actions of the other. For this reason it is sometimes called a hidden 
action problem. 

Adverse selection refers to situations where one side of the market can’t 
observe the “type” or quality of the goods on other side of the market. For 
this reason it is sometimes called a hidden information problem. 

Equilibrium in a market involving hidden action typically involves some 
form of rationing- firms would like to provide more than they do, but they 
are unwilling to do so since it will change the incentives of their customers. 
Equilibrium in a market involving hidden information will typically involve 
too little trade taking place because of the externality between the “good” 
and “bad” types. 
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Equilibrium outcomes in this market appear to be inefficient, but one has 
to be careful in making such a claim. The question to ask is “inefficient 
relative to what?” The equilibrium will always be inefficient relative to 
the equilibrium with full information. But this is of little help in making 
policy decisions: if the firms in the industry find it too costly to collect 
more information the government would probably find it too costly as well. 

The real question to ask is whether some sort of governmental interven¬ 
tion in the market could improve efficiency even if the government had the 
same information problems as the firms. 

In the case of hidden action considered above, the answer is usually 
“no.” If the government can’t observe the care taken by the consumers, 
then it can do no better than the insurance companies. Of course the 
government might have other tools at its disposal that are not available to 
the insurance company—it could compel a particular level of care, and it 
could set criminal punishments for those who did not take due care. But if 
the government can only set prices and quantities, then it can do no better 
than the private market can do. 

Similar issues arise in the case of hidden information. We have already 
seen that if the government can compel people of all risk classes to purchase 
insurance, it is possible for everyone to be made better off. This is, on the 
face of it, a good case for intervention. On the other hand, there are costs to 
government intervention as well; economic decisions made by governmental 
decree may not be as cost-effective as those made by private firms. Just 
because there are governmental actions that can improve social welfare 
doesn’t mean that these actions will be taken! 

Furthermore, there may be purely private solutions to the adverse selec¬ 
tion problems. For example, we have already seen how providing health 
insurance as a fringe benefit can help to eliminate the adverse selection 
problem. 


37.6 Signaling 

Recall our model of the used-car market: the owners of the used cars knew 
the quality, but the purchasers had to guess at the quality. We saw that 
this asymmetric information could cause problems in the market; in some 
cases, the adverse selection problem would result in too few transactions 
being made. 

However, the story doesn’t end there. The owners of the good used cars 
have an incentive to try to convey the fact that they have a good car to 
the potential purchasers. They would like to choose actions that signal 
the quality of their car to those who might buy it. 

One sensible signal in this context would be for the owner of a good used 
car to offer a warranty. This would be a promise to pay the purchaser 
some agreed upon amount if the car turned out to be a lemon. Owners of 
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the good used cars can afford to offer such a warranty while the owners of 
the lemons can’t afford this. This is a way for the owners of the good used 
cars to signal that they have good cars. 

In this case signaling helps to make the market perform better. By offer¬ 
ing the warranty—the signal—the sellers of the good cars can distinguish 
themselves from the sellers of the bad used cars. But there are other cases 
where signaling can make a market perform less well. 

Let’s consider a very simplified model of the education market first ex¬ 
amined by Michael Spence . 3 Suppose that we have two types of workers, 
able and unable. The able workers have a marginal product of <22, and the 
unable workers have a marginal product of ai, where 02 > ai. Suppose 
that a fraction b of the workers are able and 1 — b of them are unable. 

For simplicity we assume a linear production function so that the total 
output produced by L2 able workers and L\ unable workers is aiLi + <22-^2 ■ 
We also assume a competitive labor market. 

If worker quality is easily observable, then firms would just offer a wage 
of W2 = a2 to the able workers and of w\ — a\ to the unable workers. That 
is, each worker would be paid his marginal product and we would have an 
efficient equilibrium. 

But what if the firm can’t observe the marginal products? If a firm can’t 
distinguish the types of workers, then the best that it can do is to offer the 
average wage, which is w = (l — b)ai + ba,2- As long as the good and the bad 
workers both agree to work at this wage there is no problem with adverse 
selection. And, given our assumption about the production function, the 
firm produces just as much output and makes just as much profit as it 
would if it could perfectly observe the type of the worker. 

However, suppose now that there is some signal that the workers can 
acquire that will distinguish the two types. For example, suppose that the 
workers can acquire education. Let e\ be the amount of education attained 
by the type 1 workers and e2 the amount attained by the type 2 workers. 
Suppose that the workers have different costs of acquiring education, so 
that the total cost of education for the able workers is C2&2 and the total 
cost of education for the unable workers is ciei. These costs are meant to 
include not only the dollar costs of attending school, but also includes the 
opportunity costs, the costs of the effort required, and so on. 

Now we have two decisions to consider. The workers have to decide how 
much education to acquire and the firms have to decide how much to pay 
workers with different amounts of education. Let us make the extreme 
assumption that the education doesn’t affect worker productivity at all. 
Of course this isn’t true in real life—especially for economics courses—but 
it helps to keep the model simple. 


3 Michael Spence, Market Signaling (Cambridge, Mass: Harvard University Press, 
1974). 
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It turns out that the nature of the equilibrium in this model depends 
crucially on the cost of acquiring education. Suppose that c 2 < C\. This 
says that the marginal cost of acquiring education is less for the able workers 
than the unable workers. Let e* be an education level that satisfies the 
following inequalities: 


a 2 — a 1 
Cl 


< e* < 


a 2 — a\ 
C2 


Given our assumption that a 2 > ai and that c 2 < Ci there must be such 

an e*. 

Now consider the following set of choices: the able workers all acquire 
education level e* and the unable workers all acquire education level 0 , and 
the firm pays workers with education level e* a wage of a 2 and workers 
with less education than this a wage of a±. Note that the choice of the 
education level of a worker perfectly signals his type. 

But is this an equilibrium? Does anyone have an incentive to change 
his or her behavior? Each firm is paying each worker his or her marginal 
product, so the firms have no incentive to do anything differently. The 
only question is whether the workers are behaving rationally given the 
wage schedule they face. 

Would it be in the interest of an unable worker to purchase education 
level e*? The benefit to the worker would be the increase in wages a 2 — ai. 
The cost to the unable worker would be cie*. The benefits are less than 
the costs if 

<22 — <2i < Cie*. 

But we are guaranteed that this condition holds by the choice of e*. Hence 
the unable workers find it optimal to choose a zero educational level. 

Is it actually in the interest of the able workers to acquire the level of 
education e*? The condition for the benefits to exceed the costs is 


a 2 — ai > c 2 e*, 


and this condition also holds due to the choice of e*. 

Hence this pattern of wages is indeed an equilibrium: if each able worker 
chooses education level e* and each unable worker chooses a zero educa¬ 
tional level, then no worker has any reason to change his or her behavior. 
Due to our assumption about the cost differences, the education level of 
a worker can, in equilibrium, serve as a signal of the different productivi¬ 
ties. This type of signaling equilibrium is sometimes called a separating 
equilibrium since the equilibrium involves each type of worker making a 
choice that allows him to separate himself from the other type. 

Another possibility is a pooling equilibrium, in which each type of 
worker makes the same choice. For example, suppose that c 2 > ci, so that 
the able workers have a higher cost of acquiring education than the unable 
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workers. In this case it can be shown that the only equilibrium involves 
the workers all getting paid a wage based on their average ability, and so 
no signaling occurs. 

The separating equilibrium is especially interesting since it is inefficient 
from a social point of view. Each able worker finds it in his interest to pay 
for acquiring the signal, even though it doesn’t change his productivity 
at all. The able workers want to acquire the signal not because it makes 
them any more productive, but just because it distinguishes them from the 
unable workers. Exactly the same amount of output is produced in the 
(separating) signaling equilibrium as would be if there were no signaling 
at all. In this model the acquisition of the signal is a total waste from the 
social point of view. 

It is worth thinking about the nature of this inefficiency. As before, it 
arises because of an externality. If both able and unable workers were paid 
their average product, the wage of the able workers would be depressed 
because of the presence of the unable workers. Thus they would have an 
incentive to invest in signals that will distinguish them from the less able. 
This investment offers a private benefit but no social benefit. 

Of course signaling doesn’t always lead to inefficiencies. Some types of 
signals, such as the used-car warranties described above, help to facilitate 
trade. In that case the equilibrium with signals is preferred to the equilib¬ 
rium without signals. So signaling can make things better or worse; each 
case has to be examined on its own merits. 


EXAMPLE: The Sheepskin Effect 

In the extreme form of the educational signaling model described above 
education has no effect on productivity: the years spent in school serve 
only to signal the fixed ability of an individual. This is obviously an ex¬ 
aggeration: a student with 11 years of schooling almost certainly is more 
productive than one with 10 years of schooling due to the fact that they 
have acquired more useful skills during the additional year. Presumably 
part of the returns to schooling are due to signaling, and part are due to 
the acquisition of useful skills while in school. How can we separate these 
two factors? 

Labor economists who have studied the returns to education have ob¬ 
served the following suggestive fact: the earnings of people who have grad¬ 
uated from high school are much higher than the incomes of people who 
have only completed 3 years of high school. One study found that gradu¬ 
ating from high school increases earnings by 5 to 6 times as much as does 
completing a year in high school that does not result in graduation. The 
same discontinuous jump occurs for people who graduate from college. Ac¬ 
cording to one estimate, the economic return to the 16 th year of schooling 
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are about three times as high as the return to the 15th year of schooling. 4 

If education imparts productive skills, we might well expect that people 
with 11 years of education are paid more than people with 10 years of 
education. What is surprising is that there is a huge jump in earnings 
associated with high school graduation. Economists have termed this the 
sheepskin effect, in reference to the fact that diplomas were often written 
on sheepskins. Presumably, graduation from high school is some kind of 
signal. But what is it a signal of? In the educational signaling model 
described earlier, educational attainment was a signal of ability. Is that 
what high school graduation signals? Or is it something else? 

Andrew Weiss, a Boston University economist, attempted to answer 
these questions. 5 He looked at a set of data describing how workers as¬ 
sembled equipment and was able to obtain a measure of how much output 
they produced in their first month on the job. He found that there was 
a very small effect of education on output: each year of secondary educa¬ 
tion increased a worker’s output by about 1.3 percent. Furthermore, high 
school graduates produced essentially the same amount of output as non¬ 
graduates. Apparently education contributed only a small amount to the 
initial productivity of these workers. 

Weiss then looked at another data set that described various character¬ 
istics of workers in a variety of occupations. He found that high school 
graduates had significantly lower quit and absentee rates than nongrad¬ 
uates. It seems that high school graduates receive higher wages because 
they are more productive—but the reason that they are more productive 
is because they stay with the firm longer and have fewer absences. This 
suggests that the signaling model does give us insight into real-world labor 
markets. However, the actual signal sent by educational attainment is con¬ 
siderably more complex than the simplest version of the signaling model 
suggests. 


37.7 Incentives 

We turn now to a slightly different topic, the study of incentive sys¬ 
tems. As it turns out, our investigation of this topic will naturally involve 
asymmetric information. But it is useful to start with the case of full 
information. 

The central question in the design of incentive systems is “How can I get 
someone to do something for me?” Let’s pose this question in a specific 


4 See Thomas Hungerford and Gary Solon, “Sheepskin Effects in the Returns to Edu¬ 
cation,” Review of Economics and Statistics, 69, 1987, 175—77. 


5 


“High School Graduation, Performance and Wages,” Journal of Political Economy, 
96, 4, 1988, 785-820. 
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context. Suppose that you own a plot of land but you are unable to work 
on the land yourself. So you try to hire someone to do the farming for you. 
What sort of compensation system should you set up? 

One plan might involve paying the worker a lump-sum fee independent of 
how much he produces. But then he would have little incentive to work. In 
general a good incentive plan will make the payment of the worker depend 
in some way on the output he produces. The problem of incentive design is 
to determine exactly how sensitive the payment should be to the produced 
output. 

Let x be the amount of “effort” that the worker expends, and let y = /(x) 
be the amount of output produced; for simplicity we suppose that the price 
of output is 1 so that y also measures the value of the output. Let s(y) 
be the amount that you pay the worker if he produces y dollars worth of 
output. Presumably you would like to choose the function s(y) to maximize 
your profits y — s(y). 

What are the constraints that you face? In order to answer this question 
we have to look at things from the worker’s perspective. 

We assume that the worker finds effort costly, and write c(x) for the 
cost of effort x. We assume that this cost function has the usual shape: 
both total and marginal costs increase as effort increases. The utility of the 
worker who chooses effort level x is then simply s(y) —c(x) = s(/(x)) —c(x). 
The worker may have other alternatives available that give him some utility 
u. This could come from working at other jobs or from not working at all. 
All that is relevant for the design of the incentive scheme is that the utility 
that the worker gets from this job must be at least as great as the utility 
he could get elsewhere. This gives us the participation constraint: 

s(f(x)) - c(x) > u. 

Given this constraint we can determine how much output we can get 
from the worker. You want to induce the worker to choose an effort level 
x that yields you the greatest surplus given the constraint that the worker 
is willing to work for you: 

max /(x) - s(/(x)) 

X 

such that s(/(x)) — c(x) > u. 

In general, you will want the worker to choose x to just satisfy the constraint 
so that s(/(x)) — c(x) = u. Substituting this into the objective function 
we have the unconstrained maximization problem 

max /(x) — c(x) — u. 

X 

But it is easy to solve this problem! Just choose x* so that the marginal 
product equals the marginal cost: 


MP(x*) = MC(x*). 
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Any choice of x* where the marginal benefit is not equal to the marginal 
cost cannot maximize profits. 

This tells us what level of effort the owner wants to achieve; now we have 
to ask what he has to pay the worker to achieve that effort. That is, what 
does the function s(y) have to look like to induce the worker to choose to 
make x* the optimal choice? 

Suppose that you decide that you want to induce the worker to put in 
x* amount of effort. Then you must make it in his interest to do so; that 
is, you must design your incentive scheme s(y) so that the utility from 
choosing to work x* is larger than the utility of worker any other amount 
x. This gives us the constraint 

— c(x*) > s(/( x)) — c(x) for all x. 

This constraint is called the incentive compatibility constraint. It 
simply says that the utility to the worker from choosing x* must be greater 
than the utility of any other choice of effort. 

So we have two conditions that the incentive scheme must satisfy: first, 
it must give total utility to the worker of u, and second, it must make the 
marginal product of effort equal to the marginal cost of effort at the effort 
level x*. There are several ways to do this. 

Rent. The landowner could simply rent the land to the worker for some 
price R , so that the worker gets all the output he produces after he pays 
the owner R. For this scheme 

= f{x) - R. 

If the worker maximizes s(f(x)) — c(x) = f{x) — R — c(x), he will choose 
the effort level where MP(x*) = MC(x *), which is exactly what the owner 
wants. The rental rate R is determined from the participation condition. 
Since the total utility to the worker must be u we have 

f(x*) — c(x*) — R = u : 
which says R = f(x*) — c(x*) — u. 

Wage labor. In this scheme the landowner pays the worker a constant wage 
per unit of effort along with a lump sum K. This means that the incentive 
payment takes the form 

s(x) = wx + K. 

The wage rate w is equal to the marginal product of the worker at the opti¬ 
mal choice x*, MP(x*). The constant K is chosen to just make the worker 
indifferent between working for the landowner and working elsewhere; that 
is, it is chosen to satisfy the participation constraint. 


INCENTIVES 733 


The problem of maximizing s(/(x)) — c( x) then becomes 
max wx + K — c(x), 

X 

which means that the worker will choose x so as to set his marginal cost 
equal to the wage: w = MC(x). Since the wage is MP(x*), this means that 
the optimal choice of the worker will be x* such that MP(x*) = MC(x*) 
which is just what the firm wants. 

Take-it-or-leave-it. In this scheme the landowner pays the worker B* if 
he works x* and zero otherwise. The amount B* is determined by the 
participation constraint B* — c(x*) = u, so B* = u + c(x*). If the worker 
chooses any level of effort x ^ x*, he gets a utility of — c(x). If he chooses 
x* , he gets a utility of u. Hence the optimal choice for the worker is to set 

X = X . 


Each of these schemes is equivalent as far as the analysis goes: each one 
gives the worker a utility of u, and each one gives the worker an incentive to 
work the optimal amount x*. At this level of generality there is no reason 
to choose between them. 

If all of these schemes are optimal, what could a nonoptimal scheme look 
like? Here is an example. 

Sharecropping. In sharecropping the worker and the landowner each get 
some fixed percentage of the output. Suppose that the worker’s share takes 
the form s(x) = af(x) + F, where F is some constant and a < 1. This is 
not an efficient scheme for the problem under consideration. It is easy to 
see why. The worker’s maximization problem is 

max af{x) + F — c( x), 

X 

which means that he would choose a level of effort x where 

aMP(x) = MC(x). 

Such an effort level clearly cannot satisfy the efficiency condition that 
MP(x) = MC(x). 

Here is a way to summarize this analysis. In order to design an efficient 
incentive scheme it is necessary to ensure that the person who makes the 
effort decision is the residual claimant to the output. The way the owner 
can make himself as well off as possible is to make sure that he gets the 
worker to produce the optimal amount of output. This is the output level 
where the marginal product of the worker’s extra effort equals the marginal 
cost of putting forth that effort. It follows that the incentive scheme must 
provide a marginal benefit to the worker equal to his marginal product. 
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EXAMPLE: Voting Rights in the Corporation 

Normally shareholders in a corporation have the right to vote on various 
issues related to the management of the corporation while bondholders do 
not. Why is this? The answer comes from looking at the structure of payoffs 
to stockholders and bondholders. If a corporation produces X dollars of 
profit in a given years, the bondholders have first claim on these profits, 
while the amount that is left over goes to the stockholders. If the total claim 
by the bondholders is B , then the amount that goes to the stockholders is 
X — B. This makes the stockholders the residual claimants—so they have 
an incentive to make X as large as possible. The bondholders on the other 
hand only have an incentive to make sure that X is at least B , since that is 
the most that they are entitled to. Hence giving the stockholders the right 
to make decisions will generally result in larger profits. 


EXAMPLE: Chinese Economic Reforms 

Prior to 1979 Chinese rural communes were organized along orthodox 
Marxist lines. Workers were paid according to a rough estimate of how 
much they contributed to the commune income. Five percent of the com¬ 
mune’s land was set aside for private plots, but peasants were not allowed 
to travel to cities to sell the output from their private farms. All trade had 
to take place through a highly regulated government market. 

At the end of 1978 the Chinese central government instituted a major 
reform in the structure of agriculture, known as the “responsibility system.” 
In the responsibility system, any production in excess of a fixed quota 
was kept by the household and could be sold on private markets. The 
government removed restrictions on private plots and increased the amount 
of land devoted to private farming. By the end of 1984, 97 percent of the 
farmers operated under this responsibility system. 

Note that the structure of the system is very much like the optimal in¬ 
centive mechanism described above: each household makes a lump-sum 
payment to the commune but can keep anything in excess of this quota. 
Hence the marginal incentives for household production are the economi¬ 
cally appropriate ones. 

The effect of this new system on agricultural output was phenomenal: 
between 1978 and 1984, the output of Chinese agriculture increased by over 
61 percent! However, not all of this increase is due to better incentives; at 
the same time these reforms were going on, the Chinese government also 
changed the controlled prices of agricultural goods, and even allowed some 
of these prices to be determined on private markets. 

Three economists attempted to divide the increase in output into the part 
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due to better incentives and the part due to the change in prices. 6 They 
found that over three-fourths of the increase was due to the improvement 
in incentives, and only one-fourth was due to the price reforms. 


37.8 Asymmetric Information 

The above analysis provides some insights about the use of different sorts 
of incentive schemes. For example, it shows that renting the land to a 
worker is better than sharecropping. But this really proves too much. If 
our analysis is a good description of the world, then we would expect to 
see rental or wage labor used in agriculture and never see sharecropping 
used, except by mistake. 

Clearly this isn’t right. Sharecropping has been used for thousands of 
years in some parts of the world, so it is likely that it fulfills some kind of 
need. What have we left out of our model? 

Given the title of this section it is not hard to guess the answer: we’ve 
left out problems involving imperfect information. We assumed that the 
owner of the firm could perfectly observe the effort of the worker. In many 
situations of interest it may be impossible to observe the effort. At best the 
owner may observe some signal of the effort such as the resulting production 
of output. The amount of output produced by a farmer may depend in 
part on his effort, but it may also depend on the weather, the quality of 
the inputs, and many other factors. Because of this kind of “noise,” a 
payment from the owner to the worker based on output will not in general 
be equivalent to a payment based on effort alone. 

This is essentially a problem of asymmetric information: the worker can 
choose his effort level, but the owner cannot perfectly observe it. The 
owner has to guess the effort from the observed output, and the design of 
the optimal incentive scheme has to reflect this inference problem. 

Consider the four incentive schemes described above. What goes wrong 
if effort is not perfectly correlated with output? 


Rent. If the firm rents the technology to the worker, then the worker can 
get all of the output that remains after paying the fixed rental fee. If output 
has a random component, this means that the worker will have to bear all 
the risk from the random factors. If the worker is more risk averse than 
the owner—which is the likely case—this will be inefficient. In general, the 
worker would be willing to give up some of the residual profits in order to 
have a less risky income stream. 


J. McMillan, J. Whalley, and L. Zhu, “The Impact of China’s Economic Reforms 
on Agricultural Productivity Growth,” Journal of Political Economy, 97, 4, 1989, 
781-807. 
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Wage labor. The problem with wage labor is that it requires observation 
of the amount of labor input. The wage has to be based on the effort put 
in to production, not just the hours spent in the firm. If the owner can’t 
observe the amount of labor input, then it will be impossible to implement 
this kind of incentive scheme. 

Take-it-or-leave-it. If the incentive payment is based on the labor input, 
then we have the same problem with this scheme as with wage labor. If the 
payment is based on output, then the scheme involves the worker bearing 
all the risk. Even missing the “target output” by a small amount results 
in a zero payment. 

Sharecropping. This is something of a happy medium. The payment to the 
worker depends in part on observed output, but the worker and the owner 
share the risk of output fluctuations. This gives the worker an incentive to 
produce output but it doesn’t leave him bearing all the risk. 


The introduction of asymmetric information has made a drastic change 
in our evaluation of the incentive methods. If the owner can’t observe effort, 
then wage labor is infeasible. Rent and the take-it-or-leave-it scheme leave 
the worker bearing too much risk. Sharecropping is a compromise between 
the two extremes: it gives the worker some incentive to produce, but it 
doesn’t leave him with all the risk. 


EXAMPLE: Monitoring Costs 

It is not always easy to observe the amount of effort an employee puts into 
his or her job. Consider, for example, a job as a clerk in a 24-hour con¬ 
venience store. How can the manager observe the employees’ performance 
when the manager isn’t around? Even if there are ways to observe the 
physical output of the employee (shelves stocked, sales rung up) it is much 
harder to observe things like politeness to customers. 

There is little doubt that some of the worst service in the world was 
provided in the formerly Communist countries in Eastern Europe: once 
you managed to attract the attention of a clerk, you were more likely to be 
greeted by a scowl than a smile. Nevertheless, a Hungarian entrepreneur, 
Gabor Varszegi, has made millions by providing high-quality service in his 
photo developing shops in Budapest. 7 

Varszegi says that he got his start as a businessman in the mid-sixties 
by playing bass guitar and managing a rock group. “Back then,” he says, 
“the only private businessmen in Eastern Europe were rock musicians.” 


7 


See Steven Greenhouse, “A New Formula in Hungary: Speed Service and Grow Rich, : 
New York Times, June 5, 1990, Al. 
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He introduced one-hour film developing to Hungary in 1985; the next best 
alternative to his one-hour developing shops was the state-run agency that 
took one month. 

Varszegi follows two rules in labor relations: he never hires anyone who 
worked under Communism, and he pays his workers four times the market 
wage. This makes perfect sense in light of the above remarks about mon¬ 
itoring costs: there are very few employees per store and monitoring their 
behavior is very costly. If there were only a small penalty to being fired, 
there would be great temptation to slack off. By paying the workers much 
more than they could get elsewhere, Varszegi makes it very costly for them 
to be fired—and reduces his monitoring costs significantly. 


EXAMPLE: The Crameen Bank 

A village moneylender in Bangladesh charges over 150 percent interest a 
year. Any American banker would love a return of that size: why isn’t 
Citibank installing money machines in Bangladesh? To ask the question is 
to answer it: Citibank would probably not do as well as the moneylender. 
The village moneylender has a comparative advantage in these small-scale 
loans for several reasons. 

• The village moneylender can deal more effectively with the small scale of 
lending involved; 

• The moneylender has better access to information about who are good 
and bad credit risks than an outsider does. 

• The moneylender is in a better position to monitor the progress of the 
loan payments to insure repayment. 

These three problems—returns to scale, adverse selection and moral 
hazard—allow the village moneylender to maintain a local monopoly in 
the credit market. 

Such a local monopoly is especially pernicious in an underdeveloped 
country such as Bangladesh. At an interest rate of 150 percent there are 
many profitable projects that are not being undertaken by the peasants. 
Improved access to credit could lead to a major increase in investment, and 
a corresponding increase in the standards of living. 

Muhammad Yunas, an American-trained economist from Bangladesh, 
has developed an ingenious institution known as the Grameen Bank (vil¬ 
lage bank) to address some of these problems. In the Grameen plan, en¬ 
trepreneurs with separate projects get together and apply for a loan as a 
group. If the loan is approved, two members of the group get their loan 
and commence their investment activity. If they are successful in meeting 
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the repayment schedule, two more members get loans. If they are also 
successful the last member, the group leader, will get a loan. 

The Grameen bank addresses each of the three problems described above. 
Since the quality of the group influences whether or not individual members 
will get loans, potential members are highly selective about who they will 
join with. Since members of the group can only get loans if other members 
succeed in their investments, there are strong incentives to help each other 
out and share expertise. Finally, these activities of choosing candidates for 
loans and monitoring the progress of the repayments are all done by the 
peasants themselves, not directly by the loan officers at the bank. 

The Grameen bank has been very successful. It makes about 475,000 
loans a month with an average size of $70. Their loan-recovery rate is 
about 98 percent, while conventional lenders in Bangladesh achieve a loan- 
recovery rate of about 30 to 40 percent. The success of the group respon¬ 
sibility program in encouraging investment has led to its adoption in a 
number of other poverty-stricken areas in North and South America. 


Summary 

1. Imperfect and asymmetric information can lead to drastic differences in 
the nature of market equilbrium. 

2. Adverse selection refers to situations where the type of the agents is not 
observable so that one side of the market has to guess the type or quality 
of a product based on the behavior of the other side of the market. 

3. In markets involving adverse selection too little trade may take place. 
In this case it is possible that everyone can be made better off by forcing 
them to transact. 

4. Moral hazard refers to a situation where one side of the market can’t 
observe the actions of the other side. 

5. Signaling refers to the fact that when adverse selection or moral hazard 
are present some agents will want to invest in signals that will differentiate 
them from other agents. 

6. Investment in signals may be privately beneficial but publically wasteful. 
On the other hand, investment in signals may help to solve problems due 
to asymmetric information. 

7. Efficient incentive schemes (with perfect observability of effort) leave the 
worker as the residual claimant. This means that the worker will equate 
marginal benefits and marginal costs. 
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8. But if information is imperfect this is no longer true. In general, an 
incentive scheme that shares risks as well as providing incentives will be 
appropriate. 

REVIEW QUESTIONS 


1. Consider the model of the used-car market presented in this chapter. 
What is the maximum amount of consumers’ surplus that is created by 
trade in the market equilibrium? 

2. In the same model, how much consumers’ surplus would be created 
by randomly assigning buyers to sellers? Which method gives the larger 
surplus? 

3. A worker can produce x units of output at a cost of c( x) = x 2 /2. He 
can achieve a utility level of u = 0 working elsewhere. What is the optimal 
wage-labor incentive scheme s(x) for this worker? 

4. Given the setup of the previous problem, what would the worker be 
willing to pay to rent the production technology? 

5. How would your answer to the last problem change if the worker’s alter¬ 
native employment gave him u = 1? 
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MATHEMATICAL 

APPENDIX 


In this Appendix we will provide a brief review of some of the mathematical 
concepts that are used in the text. This material is meant to serve as a 
reminder of the definitions of various terms used in the text. It is emphat¬ 
ically not a tutorial in mathematics. The definitions given will generally 
be the simplest, not the most rigorous. 


A.1 Functions 

A function is a rule that describes a relationship between numbers. For 
each number x, a function assigns a unique number y according to some 
rule. Thus a function can be indicated by describing the rule, as “take a 
number and square it,” or “take a number and multiply it by 2,” and so 
on. We write these particular functions as y = x 2 , y = 2x. Functions are 
sometimes referred to as transformations. 

Often we want to indicate that some variable y depends on some other 
variable x, but we don’t know the specific algebraic relationship between the 
two variables. In this case we write y = /(x), which should be interpreted 
as saying that the variable y depends on x according to the rule /. 

Given a function y = /(x), the number x is often called the indepen¬ 
dent variable, and the number y is often called the dependent variable. 


Figure 
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The idea is that x varies independently, but the value of y depends on the 
value of x. 

Often some variable y depends on several other variables x\, x 2 , and 
so on, so we write y = f(x 1 , 2 : 2 ) to indicate that both variables together 
determine the value of y. 


A.2 Graphs 

A graph of a function depicts the behavior of a function pictorially. Figure 
A.l shows two graphs of functions. In mathematics the independent vari¬ 
able is usually depicted on the horizontal axis, and the dependent variable 
is depicted on the vertical axis. The graph then indicates the relationship 
between the independent and the dependent variables. 

However, in economics it is common to graph functions with the indepen¬ 
dent variable on the vertical axis and the dependent variable on the hori¬ 
zontal axis. Demand functions, for example, are usually depicted with the 
price on the vertical axis and the amount demanded on the horizontal axis. 


y 

y 



/y= 2x 

I 2 

I y = x 

5 

/ 5 


4 

/ 4 


3 

/ 3 


2 

1 

/ 2 

/ 1 

r 11 ! 

1 1 1 1 


1 2 3 4 x 

1 2 3 4 X 


A 

B 

Graphs of functions. Panel A denotes the graph of y — 2x, 

and panel B denotes the graph of y 

= x 2 . 


A.3 Properties of Functions 

A continuous function is one that can be drawn without lifting a pencil 
from the paper: there are no jumps in a continuous function. A smooth 
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function is one that has no “kinks” or comers. A monotonic function 
is one that always increases or always decreases; a positive monotonic 
function always increases as x increases, while a negative monotonic 
function always decreases as x increases. 


A.4 Inverse Functions 

Recall that a function has the property that for each value of x there is 
a unique value of y associated with it and that a monotonic function is 
one that is always increasing or always decreasing. This implies that for a 
monotonic function there will be a unique value of x associated with each 
value of y. 

We call the function that relates x to y in this way an inverse function. 
If you are given y as a function of x , you can calculate the inverse function 
just by solving for x as a function of y. If y = 2x, then the inverse function 
is x = y/2. If y = x 2 , then there is no inverse function; given any y , both 
x = +y/y and x = —^fy have the property that their square is equal to y. 
Thus there is not a unique value of x associated with each value of y , as is 
required by the definition of a function. 


A.5 Equations and Identities 

An equation asks when a function is equal to some particular number. 
Examples of equations are 

2x = 8 
x 2 = 9 
f{x) = 0. 

The solution to an equation is a value of x that satisfies the equation. 
The first equation has a solution of x = 4. The second equation has two 
solutions, x = 3 and x = —3. The third equation is just a general equation. 
We don’t know its solution until we know the actual rule that / stands for, 
but we can denote its solution by x*. This simply means that x* is a number 
such that fix*) = 0. We say that x* satisfies the equation /( x) = 0. 

An identity is a relationship between variables that holds for all values 
of the variables. Here are some examples of identities: 

(x + y) 2 = x 2 + 2xy + y 2 
2{x T 1) = 2x T 2. 

The special symbol = means that the left-hand side and the right-hand 
side are equal for all values of the variables. An equation only holds for 
some values of the variables, whereas an identity is true for all values of the 
variables. Often an identity is true by the definition of the terms involved. 
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A.6 Linear Functions 


A linear function is a function of the form 


y = ax + b, 

where a and b are constants. Examples of linear functions are 


y = 2x + 3 
y = x — 99. 


Strictly speaking, a function of the form y = ax + b should be called an 
affine function, and only functions of the form y = ax should be called 
linear functions. However, we will not insist on this distinction. 

Linear functions can also be expressed implicitly in forms like ax+by = c. 
In such a case, we often like to solve for y as a function of x to convert this 
to the “standard” form: 

c a 


y= b-~b X - 


A.7 Changes and Rates of Change 

The notation Ax is read as “the change in x.” It does not mean A times 
x. If x changes from x* to x**, then the change in x is just 

Ax = x** -x*. 


We can also write 

x** = x* + Ax 

to indicate that x** is x* plus a change in x. 

Typically Ax will refer to a small change in x. We sometimes express 
this by saying that Ax represents a marginal change. 

A rate of change is the ratio of two changes. If y is a function of x 
given by y = /(x), then the rate of change of y with respect to x is denoted 

by 

Ay = f(x + Ax) - /(x) 

Ax Ax 

The rate of change measures how y changes as x changes. 

A linear function has the property that the rate of change of y with 
respect to x is constant. To prove this, note that if y = a + bx, then 

Ay a + b(x + Ax) — a — bx bAx 
Ax Ax Ax 
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For nonlinear functions, the rate of change of the function will depend 
on the value of x. Consider, for example, the function y = x 2 . For this 
function 

Ay (x + Ax) 2 — x 2 x 2 + 2xAx + (Ax) 2 — x 2 

-r— = --^--- = 2x + Ax. 

Ax Ax Ax 

Here the rate of change from x to x + Ax depends on the value of x and 
on the size of the change, Ax. But if we consider very small changes in x, 
Ax will be nearly zero, so the rate of change of y with respect to x will be 
approximately 2x. 


A.8 Slopes and Intercepts 

The rate of change of a function can be interpreted graphically as the 
slope of the function. In Figure A.2A we have depicted a linear function 
y = — 2x + 4. The vertical intercept of this function is the value of y 
when x = 0, which is y = 4. The horizontal intercept is the value of x 
when y = 0, which is x = 2. The slope of the function is the rate of change 
of y as x changes. In this case, the slope of the function is —2. 




Slopes and intercepts. Panel A depicts the function y = 
—2x + 4, and panel B depicts the function y = x 2 . 


Figure 
A.2 


In general, if a linear function has the form y = ax + b, the vertical 
intercept will be y* = b and the horizontal intercept will be x* = —b/a. If 
a linear function is expressed in the form 


aixi + a 2 x 2 = c, 
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then the horizontal intercept will be the value of X\ when X2 = 0, which is 
x\ = cfa \, and the vertical intercept will occur when x\ = 0, which means 
x *2 = c/a- 2 - The slope of this function is — 01 / 02 . 

A nonlinear function has the property that its slope changes as x changes. 
A tangent to a function at some point a; is a linear function that has the 
same slope. In Figure A.2B we have depicted the function x 2 and the 
tangent line at x = 1. 

If y increases whenever x increases, then Ay will always have the same 
sign as Ax, so that the slope of the function will be positive. If on the 
other hand y decreases when x increases, or y increases when x decreases, 
Ay and Ax will have opposite signs, so that the slope of the function will 
be negative. 


A.9 Absolute Values and Logarithms 


The absolute value of a number is a function f(x) defined by the following 
rule: 



if x > 0 
if x < 0. 


Thus the absolute value of a number can be found by dropping the sign of 
the number. The absolute value function is usually written as |x|. 

The (natural) logarithm or log of x describes a particular function of 
x, which we write as y = In* or y = ln(x). The logarithm function is the 
unique function that has the properties 


ln(*y) = ln(x) + ln(y) 


for all positive numbers x and y and 

ln(e) = 1. 

(In this last equation, e is the base of natural logarithms which is equal to 
2.7183...) In words, the log of the product of two numbers is the sum of 
the individual logs. This property implies another important property of 
logarithms: 

ln(* y ) = yln(x), 

which says that the log of x raised to the power y is equal to y times the 
log of x. 


A.10 Derivatives 


The derivative of a function y = /(*) is defined to be 


df(x) 

dx 


lim 

Ax —tO 


f(x + Ax) - f(x) 


Ax 
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In words, the derivative is the limit of the rate of change of y with respect 
to a; as the change in x goes to zero. The derivative gives precise meaning 
to the phrase “the rate of change of y with respect to x for small changes 
in x.” The derivative of f(x) with respect to x is also denoted by /'(x). 

We have already seen that the rate of change of a linear function y = 
ax + b is constant. Thus for this linear function 

df{x) _ 
dx 

For a nonlinear function the rate of change of y with respect to x will 
usually depend on x. We saw that in the case of f(x) = x 2 , we had 
Ay/Ax = 2x + Ax. Applying the definition of the derivative 

df(x) 

—-— = lim 2x + Ax = 2x. 
dx Ai->0 

Thus the derivative of x 2 with respect to x is 2x. 

It can be shown by more advanced methods that if y = lnx, then 

df(x) _ 1 
dx x 


A.1 1 Second Derivatives 


The second derivative of a function is the derivative of the derivative of 
that function. If y = /(x), the second derivative of /(x) with respect to x 
is written as d 2 f(x)/dx 2 or f"(x). We know that 


d( 2x) 
dx 


= 2 


d(x 2 ) 

dx 


= 2x. 


Thus 

d 2 {2x) = d{ 2) = Q 
dx 2 dx 
d 2 [x 2 ) d{ 2x) ^ 

dx 2 dx 

The second derivative measures the curvature of a function. A function 
with a negative second derivative at some point is concave near that point; 
its slope is decreasing. A function with a positive second derivative at a 
point is convex near that point; its slope is increasing. A function with a 
zero second derivative at a point is flat near that point. 
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A.12 The Product Rule and the Chain Rule 

Suppose that g{x) and h(x) are both functions of x. We can define the 
function /(x) that represents their product by /(x) = g(x)h(x). Then the 
derivative of /(x) is given by 

LLJj Hub Unh 

Given two functions y = g{x) and 2 = h(y), the composite function is 

/(x) = h(g(x)). 

For example, if g(x) = x 2 and h(y) = 2y + 3, then the composite function 
is 

/(x) = 2x 2 + 3. 

The chain rule says that the derivative of a composite function, /(x), 
with respect to x is given by 

df( x) _ dh(y) dg(x) 
dx dy dx 

In our example, dh(y)/dy = 2, and dg{x)/dx = 2x, so the chain rule says 
that df(x)/dx = 2 x 2x = 4x. Direct calculation verifies that this is the 
derivative of the function /(x) = 2x 2 + 3. 


A.13 Partial Derivatives 

Suppose that y depends on both X\ and X 2 , so that y = f(x i,X 2 ). Then 
the partial derivative of /(xi,X 2 ) with respect to x\ is defined by 

d/(xi,x 2 ) _ /(x 1 + Axi,x 2 ) - /(xi,x 2 ) 

dx\ Axi— »o Axi 

The partial derivative of /(xi,X 2 ) with respect to x\ is just the derivative 
of the function with respect to Xi, holding X 2 fixed. Similarly, the partial 
derivative with respect to X 2 is 

df(x i,x 2 ) .. /(xi,x 2 + Ax 2 ) - /(xi,x 2 ) 

--- = lim -. 

OX 2 Ax 2 —»o AX 2 

Partial derivatives have exactly the same properties as ordinary derivatives; 
only the name has been changed to protect the innocent (that is, people 
who haven’t seen the d symbol). 
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In particular, partial derivatives obey the chain rule, but with an extra 
twist. Suppose that x\ and X 2 both depend on some variable t and that 
we define the function g(t ) by 

g(t) = f(x 1 (t),x 2 (t)). 

Then the derivative of g(t) with respect to t is given by 

dg(t) _ df(xi,x 2 ) dxijt) df(x u x 2 ) dx 2 {t) 

dt dxi dt dx 2 dt 

When t changes, it affects both X\ (t) and x 2 (t). Therefore, we need to 
calculate the derivative of f(xi,x 2 ) with respect to each of those changes. 


A.14 Optimization 


If y = f(x), then /( x) achieves a maximum at x* if f(x*) > f(x) for 
all x. It can be shown that if f(x) is a smooth function that achieves its 
maximum value at x*, then 


df(x*) 

dx 


d 2 f(x *) 
dx 2 


< 0 . 


These expressions are referred to as the first-order condition and the 
second-order condition for a maximum. The first-order condition says 
that the function is flat at x*, while the second-order condition says that 
the function is concave near x*. Clearly both of these properties have to 
hold if x* is indeed a maximum. 

We say that f(x) achieves its minimum value at x* if f(x*) < /( x) for 
all x. If f(x) is a smooth function that achieves its minimum at x*, then 


df (x*) 
dx 


d 2 f(x *) 
dx 2 


> 0 . 


The first-order condition again says that the function is flat at x*, while 
the second-order condition now says that the function is convex near x*. 

If y = f(xi,x 2 ) is a smooth function that achieves its maximum or 
minimum at some point (x^x^), then we must satisfy 


df( a 


2) 


dxi 

df(x{,x* 2 ) 

dx 2 


= 0 


= 0 . 


These are referred to as the first-order conditions. There are also second- 
order conditions for this problem, but they are more difficult to describe. 
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A.15 Constrained Optimization 

Often we want to consider the maximum or minimum of some function over 
some restricted values of (xi,X2). The notation 

max f(x i, £2) 

X\ ,X2 


such that g(x 1,22) = c. 


means 

find x\ and xj such that /(x*,x2) > f(x 1,^2) for all values of x\ and X2 
that satisfy the equation g(x 1,^2) = c. 

The function f(x 1,^2) is called the objective function, and the equa¬ 
tion g(x 1,^2) = c is called the constraint. Methods for solving this kind 
of constrained maximization problem are described in the Appendix to 
Chapter 5. 


ANSWERS 


1 The Market 

1.1. It would be constant at $500 for 25 apartments and then drop to $200. 

1.2. In the first case, $500, and in the second case, $200. In the third case, 
the equilibrium price would be any price between $200 and $500. 

1.3. Because if we want to rent one more apartment, we have to offer a lower 
price. The number of people who have reservation prices greater than p 
must always increase as p decreases. 

1.4. The price of apartments in the inner ring would go up since demand 
for apartments would not change but supply would decrease. 

1.5. The price of apartments in the inner ring would rise. 

1.6. A tax would undoubtedly reduce the number of apartments supplied 
in the long run. 

1.7. He would set a price of 25 and rent 50 apartments. In the second case 
he would rent all 40 apartments at the maximum price the market would 
bear. This would be given by the solution to D{p) = 100 — 2 p = 40, which 
is p* = 30. 

1.8. Everyone who had a reservation price higher than the equilibrium 
price in the competitive market, so that the final outcome would be Pareto 
efficient. (Of course in the long run there would probably be fewer new 
apartments built, which would lead to another kind of inefficiency.) 


2 Budget Constraint 


2.1. The new budget line is given by 2p\Xi + 8 ^ 2*2 = 4m. 

2.2. The vertical intercept ( X 2 axis) decreases and the horizontal intercept 
(cci axis) stays the same. Thus the budget line becomes flatter. 
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2.3. Flatter. The slope is —2pi/3p2- 

2.4. A good whose price has been set to 1; all other goods’ prices are 
measured relative to the numeraire good’s price. 

2.5. A tax of 8 cents a gallon. 

2 .6. (pi + t)x i + ( P 2 — s)x 2 = m— u. 

2.7. Yes, since all of the bundles the consumer could afford before are 
affordable at the new prices and income. 


3 Preferences 

3.1. No. It might be that the consumer was indifferent between the two 
bundles. All we are justified in concluding is that (x±, X 2 ) A (j/i, 1 / 2 ). 

3.2. Yes to both. 

3.3. It is transitive, but it is not complete—two people might be the same 
height. It is not reflexive since it is false that a person is strictly taller than 
himself. 

3.4. It is transitive, but not complete. What if A were bigger but slower 
than B? Which one would he prefer? 

3.5. Yes. An indifference curve can cross itself, it just can’t cross another 
distinct indifference curve. 

3.6. No, because there are bundles on the indifference curve that have 
strictly more of both goods than other bundles on the (alleged) indifference 
curve. 

3.7. A negative slope. If you give the consumer more anchovies, you’ve 
made him worse off, so you have to take away some pepperoni to get him 
back on his indifference curve. In this case the direction of increasing utility 
is toward the origin. 

3.8. Because the consumer weakly prefers the weighted average of two bun¬ 
dles to either bundle. 

3.9. If you give up one $5 bill, how many $1 bills do you need to compen¬ 
sate you? Five $1 bills will do nicely. Hence the answer is —5 or —1/5, 
depending on which good you put on the horizontal axis. 

3.10. Zero—if you take away some of good 1, the consumer needs zero units 
of good 2 to compensate him for his loss. 
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3.11. Anchovies and peanut butter, scotch and Kool Aid, and other similar 
repulsive combinations. 


4 Utility 

4.1. The function f(u) = u 2 is a monotonic transformation for positive u, 
but not for negative u. 

4.2. (1) Yes. (2) No (works for v positive). (3) No (works for v negative). 
(4) Yes (only defined for v positive). (5) Yes. (6) No. (7) Yes. (8) No. 

4.3. Suppose that the diagonal intersected a given indifference curve at 
two points, say ( x,x ) and (y 7 y). Then either x > y or y > x, which 
means that one of the bundles has more of both goods. But if preferences 
are monotonic, then one of the bundles would have to be preferred to the 
other. 

4.4. Both represent perfect substitutes. 

4.5. Quasilinear preferences. Yes. 

4.6. The utility function represents Cobb-Douglas preferences. No. Yes. 

4.7. Because the MRS is measured along an indifference curve, and utility 
remains constant along an indifference curve. 


5 Choice 

5.1. X 2 = 0 when p 2 > pi, X 2 = m/p 2 when p 2 < pi, and anything between 
0 and m/p 2 when pi = P 2 - 

5.2. The optimal choices will be X\ = m/pi and X 2 = 0 if p\/p 2 < b, 
Xi = 0 and X2 = m/p2 if P1/P2 > b 7 and any amount on the budget line if 
P1/P2 = b. 

5.3. Let z be the number of cups of coffee the consumer buys. Then we 
know that 2 z is the number of teaspoons of sugar he or she buys. We must 
satisfy the budget constraint 


2piz + p 2 z = TO. 


Solving for z we have 


TO 
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5.4. We know that you’ll either consume all ice cream or all olives. Thus 
the two choices for the optimal consumption bundles will be X\ = m/pi, 
X 2 = 0, or x\ = 0, X 2 = m/p 2 . 

5.5. This is a Cobb-Douglas utility function, so she will spend 4/(1 + 4) = 
4/5 of her income on good 2. 

5.6. For kinked preferences, such as perfect complements, where the change 
in price doesn’t induce any change in demand. 


6 Demand 

6.1. No. If her income increases, and she spends it all, she must be pur¬ 
chasing more of at least one good. 

6.2. The utility function for perfect substitutes is u(x 1 , 2 : 2 ) = X\ + X 2 - 
Thus if u{x 1,2:2) > u{yi,l/ 2 ), we have x\ + X 2 > y 1 + y 2 - It follows that 
tx 1 + tX 2 > tyi + ty 2 , so that u(txi,tx 2 ) > u(tyi,ty 2 )- 

6.3. The Cobb-Douglas utility function has the property that 

u(txi,tx 2 ) = (tei)°(te 2 ) 1_a = t a t 1 ^ a x'l x\~ a = tx^x\~ a = tu{xi,x 2 )- 

Thus if u(x\ , X 2 ) > u(y\,y 2 ), we know that u(tx\,tx 2 ) > u(tyi : ty 2 ), so 
that Cobb-Douglas preferences are indeed homothetic. 

6.4. The demand curve. 

6.5. No. Concave preferences can only give rise to optimal consumption 
bundles that involve zero consumption of one of the goods. 

6.6. Normally they would be complements, at least for non-vegetarians. 

6.7. We know that X\ = m/(p\ + P 2 )- Solving for p\ as a function of the 
other variables, we have 

m 


6.8. False. 


7 Revealed Preference 

7.1. No. This consumer violates the Weak Axiom of Revealed Preference 
since when he bought {x\, X 2 ) he could have bought {y\, 2 / 2 ) and vice versa. 
In symbols: 


P\X\ + P2X2 = lxl + 2x2 = 5>4=lx2 + 2xl = piyi + p 2 y 2 
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and 


9 i 2 /i + 922/2 = 2x2 + lxl = 5>4 = 2xl + lx2 = 91 X 1 + qix-i. 

7.2. Yes. No violations of WARP are present, since the y-bundle is not 
affordable when the x-bundle was purchased and vice versa. 

7.3. Since the y-bundle was more expensive than the x-bundle when the 
x-bundle was purchased and vice versa, there is no way to tell which bundle 
is preferred. 

7.4. If both prices changed by the same amount. Then the base-year bundle 
would still be optimal. 

7.5. Perfect complements. 


8 Slutsky Equation 


8.1. Yes. To see this, use our favorite example of red pencils and blue 
pencils. Suppose red pencils cost 10 cents a piece, and blue pencils cost 
5 cents a piece, and the consumer spends $1 on pencils. She would then 
consume 20 blue pencils. If the price of blue pencils falls to 4 cents a piece, 
she would consume 25 blue pencils, a change which is entirely due to the 
income effect. 

8.2. Yes. 

8.3. Then the income effect would cancel out. All that would be left would 
be the pure substitution effect, which would automatically be negative. 

8.4. They are receiving tx' in revenues and paying out tx, so they are losing 
money. 

8.5. Since their old consumption is affordable, the consumers would have to 
be at least as well-off. This happens because the government is giving them 
back more money than they are losing due to the higher price of gasoline. 


9 Buying and Selling 


9.1. Her gross demands are (9,1). 

9.2. The bundle ( 221 , 2 / 2 ) = (3,5) costs more than the bundle (4,4) at the 
current prices. The consumer will not necessarily prefer consuming this 
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bundle, but would certainly prefer to own it, since she could sell it and 
purchase a bundle that she would prefer. 

9.3. Sure. It depends on whether she was a net buyer or a net seller of the 
good that became more expensive. 

9.4. Yes, but only if the U.S. switched to being a net exporter of oil. 

9.5. The new budget line would shift outward and remain parallel to the 
old one, since the increase in the number of hours in the day is a pure 
endowment effect. 

9.6. The slope will be positive. 


10 Intertemporal Choice 


10.1. According to Table 10.1, $1 20 years from now is worth 3 cents today 
at a 20 percent interest rate. Thus $1 million is worth .03 x 1,000,000 = 
$30, 000 today. 

10.2. The slope of the intertemporal budget constraint is equal to —(l + r). 
Thus as r increases the slope becomes more negative (steeper). 

10.3. If goods are perfect substitutes, then consumers will only purchase the 
cheaper good. In the case of intertemporal food purchases, this implies that 
consumers only buy food in one period, which may not be very realistic. 

10.4. In order to remain a lender after the change in interest rates, the 
consumer must be choosing a point that he could have chosen under the 
old interest rates, but decided not to. Thus the consumer must be worse 
off. If the consumer becomes a borrower after the change, then he is choos¬ 
ing a previously unavailable point that cannot be compared to the initial 
point (since the initial point is no longer available under the new budget 
constraint), and therefore the change in the consumer’s welfare is unknown. 

10.5. At an interest rate of 10%, the present value of $100 is $90.91. At a 
rate of 5% the present value is $95.24. 


11 Asset Markets 


11.1. Asset A must be selling for 11/(1 + .10) = $10. 

11.2. The rate of return is equal to (10,000 + 10,000)/100, 000 = 20%. 


ANSWERS A17 


11.3. We know that the rate of return on the nontaxable bonds, r, must be 
such that (1 — t)r t = r, therefore (1 — .40).10 = .06 = r. 

11.4. The price today must be 40/(1 + .10) 10 = $15.42. 


12 Uncertainty 

12.1. We need a way to reduce consumption in the bad state and increase 
consumption in the good state. To do this you would have to sell insurance 
against the loss rather than buy it. 

12.2. Functions (a) and (c) have the expected utility property (they are 
affine transformations of the functions discussed in the chapter), while (b) 
does not. 

12.3. Since he is risk-averse, he prefers the expected value of the gamble, 
$325, to the gamble itself, and therefore he would take the payment. 

12.4. If the payment is $320 the decision will depend on the form of the 
utility function; we can’t say anything in general. 

12.5. Your picture should show a function that is initially convex, but then 
becomes concave. 

12.6. In order to self-insure, the risks must be independent. However, this 
does not hold in the case of flood damage. If one house in the neighborhood 
is damaged by a flood it is likely that all of the houses will be damaged. 


13 Risky Assets 


13.1. To achieve a standard deviation of 2% you will need to invest x = 
Vx/vm — 2/3 of your wealth in the risky asset. This will result in a rate of 
return equal to (2/3).09 + (1 — 2/3).06 = 8%. 

13.2. The price of risk is equal to ( r m — rf)/a m = (9 — 6)/3 = 1. That 
is, for every additional percent of standard deviation you can gain 1% of 
return. 

13.3. According to the CAPM pricing equation, the stock should offer an 
expected rate of return of rj + (3 (r m — rj) — .05 + 1.5(.10 — .05) = .125 or 
12.5%. The stock should be selling for its expected present value, which is 
equal to 100/1.125 = $88.89. 
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14 Consumer's Surplus 


14.1. The equilibrium price is $10 and the quantity sold is 100 units. If the 
tax is imposed, the price rises to $11, but 100 units of the good will still 
be sold, so there is no deadweight loss. 

14.2. We want to compute the area under the demand curve to the left of 
the quantity 6. Break this up into the area of a triangle with a base of 
6 and a height of 6 and a rectangle with base 6 and height 4. Applying 
the formulas from high school geometry, the triangle has area 18 and the 
rectangle has area 24. Thus gross benefit is 42. 

14.3. When the price is 4, the consumer’s surplus is given by the area of 
a triangle with a base of 6 and a height of 6; i.e., the consumer’s surplus 
is 18. When the price is 6, the triangle has a base of 4 and a height of 4, 
giving an area of 8. Thus the price change has reduced consumer’s surplus 
by $10. 

14.4. Ten dollars. Since the demand for the discrete good hasn’t changed, 
all that has happend is that the consumer has had to reduce his expenditure 
on other goods by ten dollars. 


15 Market Demand 


15.1. The inverse demand curve is P(q) = 200 — 2 q. 

15.2. The decision about whether to consume the drug at all could well 
be price sensitive, so the adjustment of market demand on the extensive 
margin would contribute to the elasticity of the market demand. 

15.3. Revenue is R(p) = 12 p — 2 p 2 , which is maximized at p = 3. 

15.4. Revenue is pD(p) = 100, regardless of the price, so all prices maximize 
revenue. 

15.5. True. The weighted average of the income elasticities must be 1, so 
if one good has a negative income elasticity, the other good must have an 
elasticity greater than 1 to get the average to be 1. 


16 Equilibrium 

16.1. The entire subsidy gets passed along to the consumers if the supply 
curve is flat, but the subsidy is totally received by the producers when the 
supply curve is vertical. 


ANSWERS A19 


16.2. The consumer. 

16.3. In this case the demand curve for red pencils is horizontal at the price 
Pt, since that is the most that they would be willing to pay for a red pencil. 
Thus, if a tax is imposed on red pencils, consumers will end up paying pb 
for them, so the entire amount of the tax will end up being borne by the 
producers (if any red pencils are sold at all—it could be that the tax would 
induce the producer to get out of the red pencil business). 

16.4. Here the supply curve of foreign oil is flat at $25. Thus the price 
to the consumers must rise by the $5 amount of the tax, so that the net 
price to the consumers becomes $30. Since foreign oil and domestic oil 
are perfect substitutes as far as the consumers are concerned, the domestic 
producers will sell their oil for $30 as well and get a windfall gain of $5 per 
barrel. 

16.5. Zero. The deadweight loss measures the value of lost output. Since 
the same amount is supplied before and after the tax, there is no deadweight 
loss. Put another way: the suppliers are paying the entire amount of the 
tax, and everything they pay goes to the government. The amount that 
the suppliers would pay to avoid the tax is simply the tax revenue the 
government receives, so there is no excess burden of the tax. 

16.6. Zero revenue. 

16.7. It raises negative revenue, since in this case we have a net subsidy of 
borrowing. 


17 Auctions 

17.1. Since the collectors likely have their own values for the quilts, and 
don’t particularly care about the other bidders’ values, it is a private-value 
auction. 

17.2. Following the analysis in the text, there are four equally likely config¬ 
urations of bidders: (8,8), (8,10), (10,8), and (10,10). With zero reservation 
price, the optimal bids will be (8,9,9,10), resulting in expected profit of $9. 
The only candidate for a reservation price is $10, which yields expected 
profit of 30/4 = $7.50. Hence zero is a profit-maximizing reservation price 
in this auction. 

17.3. Have each person write down a value, then award the two books to 
the students with the two highest values, but just charge them the bid of 
the third highest student. 
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17.4. It was efficient in the sense that it awarded the license to the firm 
that valued it most highly. But it took a year for this to happen, which is 
inefficient. A Vickrey auction or an English auction would have achieved 
the same result more quickly. 

17.5. This is a common-value auction since the value of the prize is the same 
to all bidders. Normally, the winning bidder overestimates the number of 
pennies in the jar, illustrating the winner’s curse. 


18 Technology 

18.1. Increasing returns to scale. 

18.2. Decreasing returns to scale. 

18.3. If a + b = 1, we have constant returns to scale, a + b < 1 gives 
decreasing returns to scale, and a + b > 1 gives increasing returns to scale. 

18.4. 4 x 3 = 12 units. 

18.5. True. 

18.6. Yes. 


19 Profit Maximization 

19.1. Profits will decrease. 

19.2. Profit would increase, since output would go up more than the cost 
of the inputs. 

19.3. If the firm really had decreasing returns to scale, dividing the scale 
of all inputs by 2 would produce more than half as much output. Thus the 
subdivided firm would make more profits than the big firm. This is one 
argument why having everywhere decreasing returns to scale is implausible. 

19.4. The gardener has ignored opportunity costs. In order to accurately 
account for the true costs, the gardener must include the cost of her own 
time used in the production of the crop, even if no explicit wage was paid. 

19.5. Not in general. For example, consider the case of uncertainty. 

19.6. Increase. 

19.7. The use of x\ does not change, and profits will increase. 

19.8. May not. 
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20 Cost Minimization 


20.1. Since profit is equal to total revenue minus total costs, if a firm is not 
minimizing costs then there exists a way for the firm to increase profits; 
however, this contradicts the fact that the firm is a profit maximizer. 

20.2. Increase the use of factor 1 and decrease the use of factor 2. 

20.3. Since the inputs are identically priced perfect substitutes, the firm 
will be indifferent between which of the inputs it uses. Thus the firm will 
use any amounts of the two inputs such that aq + X 2 = y. 

20.4. The demand for paper either goes down or stays constant. 

20.5. It implies that AuqAaq < 0, where Auq = w\ — w£ and Aaq = 


21 Cost Curves 


21.1. True, true, false. 

21.2. By simultaneously producing more output at the second plant and 
reducing production at the first plant, the firm can reduce costs. 

21.3. False. 


22 Firm Supply 


22.1. The inverse supply curve is p = 20 y, so the supply curve is y = p/20. 

22.2. Set AC = MC to find lOy + 1000/y = 20y. Solve to get y* = 10. 

22.3. Solve for p to get P s (y ) = (y — 100)/20. 

22.4. At 10 the supply is 40 and at 20 the supply is 80. The producer’s 
surplus is composed of a rectangle of area 10 x 40 plus a triangle of area 
' x 10 x 40, which gives a total change in producer’s surplus of 600. This 
is the same as the change in profits, since the fixed costs don’t change. 

22.5. The supply curve is given by y = p/2 for all p > 2, and y = 0 for all 
p < 2. At p = 2 the firm is indifferent between supplying 1 unit of output 
or not supplying it. 
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22.6. Mostly technical (in more advanced models this could be market), 
market, could be either market or technical, technical. 

22.7. That all firms in the industry take the market price as given. 

22.8. The market price. A profit-maximizing firm will set its output such 
that the marginal cost of producing the last unit of output is equal to its 
marginal revenue, which in the case of pure competition is equal to the 
market price. 

22.9. The firm should produce zero output (with or without fixed costs). 

22.10. In the short run, if the market price is greater than the average 
variable cost, a firm should produce some output even though it is losing 
money. This is true because the firm would have lost more had it not 
produced since it must still pay fixed costs. However, in the long run there 
are no fixed costs, and therefore any firm that is losing money can produce 
zero output and lose a maximum of zero dollars. 

22.11. The market price must be equal to the marginal cost of production 
for all firms in the industry. 


23 Industry Supply 


23.1. The inverse supply curves are P\(y\) = 10 + yi and ^ 2 ( 3 / 2 ) = 15 + J/ 2 - 
When the price is below 10 neither firm supplies output. When the price 
is 15 firm 2 will enter the market, and at any price above 15, both firms 
are in the market. Thus the kink occurs at a price of 15. 

23.2. In the short run, the consumers pay the entire amount of the tax. In 
the long run it is paid by the producers. 

23.3. False. A better statement would be: convenience stores can charge 
high prices because they are near the campus. Because of the high prices 
the stores are able to charge, the landowners can in turn charge high rents 
for the use of the convenient location. 

23.4. True. 

23.5. The profits or losses of the firms that are currently operating in the 
industry. 

23.6. Flatter. 

23.7. No, it does not violate the model. In accounting for the costs we 
failed to value the rent on the license. 
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24 Monopoly 

24.1. No. A profit-maximizing monopolist would never operate where the 
demand for its product was inelastic. 

24.2. First solve for the inverse demand curve to get p(y) = 50 — y/2. 
Thus the marginal revenue is given by MR(y) = 50 — y. Set this equal 
to marginal cost of 2, and solve to get y = 48. To determine the price, 
substitute into the inverse demand function, p(48) = 50 — 48/2 = 26. 

24.3. The demand curve has a constant elasticity of —3. Using the formula 
p[l + 1/e] = MC, we substitute to get p[ 1 — 1/3] = 2. Solving, we get 
p = 3. Substitute back into the demand function to get the quantity 
produced: D( 3) = 10 x 3 -3 . 

24.4. The demand curve has a constant elasticity of —1. Thus marginal 
revenue is zero for all levels of output. Hence it can never be equal to 
marginal cost. 

24.5. For a linear demand curve the price rises by half the change in cost. 
In this case, the answer is $3. 

24.6. In this case p = kMC , where k = 1/(1 — 1/3) = 3/2. Thus the price 
rises by $9. 

24.7. Price will be two times marginal cost. 

24.8. A subsidy of 50 percent, so the marginal costs facing the monopo¬ 
list are half the actual marginal costs. This will ensure that price equals 
marginal cost at the monopolist’s choice of output. 

24.9. A monopolist operates where p(y) +yAp/Ay = MC(y). Rearranging, 
we have p{y) = MC(y) — yAp/Ay. Since demand curves have a negative 
slope, we know that Ap/Ay < 0, which proves that p(y) > MC(y). 

24.10. False. Imposing a tax on a monopolist may cause the market price 
to rise more than, the same as, or less than the amount of the tax. 

24.11. A number of problems arise, including: determining the true mar¬ 
ginal costs for the firm, making sure that all customers will be served, and 
ensuring that the monopolist will not make a loss at the new price and 
output level. 

24.12. Some appropriate conditions are: large fixed costs and small mar¬ 
ginal costs, large minimum efficient scale relative to the market, ease of 
collusion, etc. 
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25 Monopoly Behavior 


25.1. Yes, if it can perfectly price discriminate. 

25.2. pi = ejc/(1 + Ej) for i = 1,2. 

25.3. If he can perfectly price discriminate, he can extract the entire con¬ 
sumers’ surplus; if he can charge for admission, he can do the same. Hence, 
the monopolist does equally well under either pricing policy. (In practice, 
it is much easier to charge for admission than to charge a different price 
for every ride.) 

25.4. This is third-degree price discrimination. Apparently the Disneyland 
administrators believe that residents of Southern California have more elas¬ 
tic demands than other visitors to their park. 


26 Factor Markets 


26.1. Sure. A monopsonist can produce at any level of supply elasticity. 

26.2. Since the demand for labor would exceed the supply at such a wage, 
we would presumably see unemployment. 

26.3. We find the equilibrium prices by substituting into the demand func¬ 
tions. Since p = a — by , we can use the solution for y to find 

3a + c 
P= —- 

Since k = a — 2 bx, we can use the solution for x to find 

, a + c 


27 Oligopoly 

27.1. In equilibrium each firm will produce (a — c)/36, so the total industry 
output is 2 (a — c)/3 b. 

27.2. Nothing. Since all firms have the same marginal cost, it doesn’t 
matter which of them produces the output. 
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27.3. No, because one of the choices open to the Stackelberg leader is to 
choose the level of output it would have in the Cournot equilibrium. So it 
always has to be able to do at least this well. 

27.4. We know from the text that we must have p[ 1 — l/7T.|e|] = MC. Since 
MC > 0, and p > 0, we must have 1 — l/n|e| > 0. Rearranging this 
inequality gives the result. 

27.5. Make f 2 {yi) steeper than /i( 2 / 2 )- 

27.6. In general, no. Only in the case of the Bertrand solution does price 
equal the marginal cost. 


28 Game Theory 


28.1. The second player will defect in response to the first player’s (mis¬ 
taken) defection. But then the first player will defect in response to that, 
and each player will continue to defect in response to the other’s defection! 
This example shows that tit-for-tat may not be a very good strategy when 
players can make mistakes in either their actions or their perceptions of the 
other player’s actions. 

28.2. Yes and no. A player prefers to play a dominant strategy regardless of 
the strategy of the opponent (even if the opponent plays her own dominant 
strategy). Thus, if all of the players are using dominant strategies then it 
is the case that they are all playing a strategy that is optimal given the 
strategy of their opponents, and therefore a Nash equilibrium exists. How¬ 
ever, not all Nash equilibria are dominant strategy equilibria; for example, 
see Table 28.2. 

28.3. Not necessarily. We know that your Nash equilibrium strategy is 
the best thing for you to do as long as your opponent is playing her Nash 
equilibrium strategy, but if she is not then perhaps there is a better strategy 
for you to pursue. 

28.4. Formally, if the prisoners are allowed to retaliate the payoffs in the 
game may change. This could result in a Pareto efficient outcome for 
the game (for example, think of the case where the prisoners both agree 
that they will kill anyone who confesses, and assume death has a very low 
utility). 

28.5. The dominant Nash equilibrium strategy is to defect in every round. 
This strategy is derived via the same backward induction process that was 
used to derive the finite 10-round case. The experimental evidence using 
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much smaller time periods seems to indicate that players rarely use this 
strategy. 

28.6. The equilibrium has player B choosing left and player A choosing top. 
Player B prefers to move first since that results in a payoff of 9 versus a 
payoff of 1. (Note, however, that moving first is not always advantageous 
in a sequential game. Can you think of an example?) 


29 Game Applications 


29.1. In a Nash equilibrium, each player is making a best response to the 
other player’s best response. In a dominant strategy equilibrium, each 
player’s choice is a best response to any choice the other player makes. 

29.2. No, because when r = 1/3 there is an infinity of best responses, not 
a single one, as is required for the mathematical definition of a function. 

29.3. Not necessarily; it depends on the payoffs of the game. In chicken if 
both choose to drive straight they receive the worst payoff. 

29.4. It is row’s expected payoff in the equilibrium strategy of kicking to 
the left with probability .7, while column jumps to the left with probability 
.6. We have to sum the payoffs to row over four events: the probability row 
kicks left and column defends left x row’s payoff in this case + probability 
row kicks right and column defends left x row’s payoff in this case, and so 
on. The numbers are (,7)(.6)50 + (,7)(.4)80 + (.3)(.6)90 + (,3)(.4)20 = 62. 

29.5. He means that he will bid low in order to get the contract, but then 
charge high prices subsequently for any changes. The client has to go along, 
since it is costly for him to switch in the middle of a job. 


30 Behavioral Economics 


30.1. The first group is more likely to buy, due to the “framing effect.” 

30.2. The “bracketing effect” makes it likely that the meals chosen by Mary 
will have more variety. 

30.3. From the viewpoint of classical consumer theory, more choice is bet¬ 
ter. But it is certainly possible that too much choice could confuse the 
employees, so 10 might be a safer choice. If you did decide to offer 50 mu¬ 
tual funds, it would be a good idea to group them into a relatively small 
number of categories. 
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30.4. The probability of heads coming up 3 times in a row is \ x \ x \ = 
| = .125. The probability of tails coming up in a row is also .125, so the 
probability of a run of 3 heads or tails is .25. 

30.5. It is called “time inconsistency.” 


31 Exchange 

31.1. Yes. For example, consider the allocation where one person has every¬ 
thing. Then the other person is worse off at this allocation than he would 
be at an allocation where he had something. 

31.2. No. For this would mean that at the allegedly Pareto efficient al¬ 
location there is some way to make everyone better off, contradicting the 
assumption of Pareto efficiency. 

31.3. If we know the contract curve, then any trading should end up some¬ 
where on the curve; however, we don’t know where. 

31.4. Yes, but not without making someone else worse off. 

31.5. The value of excess demand in the remaining two markets must sum 
to zero. 


32 Production 

32.1. Giving up 1 coconut frees up $6 worth of resources that could be used 
to produce 2 pounds (equals $6 worth) of fish. 

32.2. A higher wage would produce a steeper isoprofit line, implying that 
the profit maximizing level for the firm would occur at a point to the left of 
the current equilibrium, entailing a lower level of labor demand. However, 
under this new budget constraint Robinson will want to supply more than 
the required level of labor (why?) and therefore the labor market will not 
be in equilibrium. 

32.3. Given a few assumptions, an economy that is in competitive equilib¬ 
rium is Pareto efficient. It is generally recognized that this is a good thing 
for a society since it implies that there are no opportunities to make any in¬ 
dividual in the economy better off without hurting someone else. However, 
it may be that the society would prefer a different distribution of welfare; 
that is, it may be that society prefers making one group better off at the 
expense of another group. 

32.4. He should produce more fish. His marginal rate of substitution indi¬ 
cates that he is willing to give up two coconuts for an additional fish. The 
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marginal rate of transformation implies that he only has to give up one 
coconut to get an additional fish. Therefore, by giving up a single coconut 
(even though he would have been willing to give up two) he can have an 
additional fish. 

32.5. Both would have to work 9 hours per day. If they both work for 
6 hours per day (Robinson producing coconuts, and Friday catching fish) 
and give half of their total production to the other, they can produce the 
same output. The reduction in the hours of work from 9 to 6 hours per day 
is due to rearranging production based on each individual’s comparative 
advantage. 


33 Welfare 

33.1. The major shortcoming is that there are many allocations that cannot 
be compared—there is no way to decide between any two Pareto efficient 
allocations. 

33.2. It would have the form: W(u±, ..., u n ) = max{it 1; ..., u n }. 

33.3. Since the Nietzschean welfare function cares only about the best off 
individual, welfare maxima for this allocation would typically involve one 
person getting everything. 

33.4. Suppose that this is not the case. Then each individual envies someone 
else. Let’s construct a list of who envies whom. Person A envies someone— 
call him person B. Person B in turn envies someone—say person C. And 
so on. But eventually we will find someone who envies someone who came 
earlier in the list. Suppose the cycle is “C envies D envies E envies C.” 
Then consider the following swap: C gets what D has, D gets what E has, 
and E gets what C has. Each person in the cycle gets a bundle that he 
prefers, and thus each person is made better off. But then the original 
allocation couldn’t have been Pareto efficient! 

33.5. First vote between x and z, and then vote between the winner (z) 
and y. First pair x and y, and then vote between the winner (x) and z. 
The fact that the social preferences are intransitive is responsible for this 
agenda-setting power. 


34 Externalities 

34.1. True. Usually, efficiency problems can be eliminated by the delin¬ 
eation of property rights. However, when we impose property rights we 
are also imposing an endowment, which may have important distributional 
consequences. 
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34.2. False. 

34.3. Come on, your roommates aren’t all bad ... 

34.4. The government could just give away the optimal number of grazing 
rights. Another alternative would be to sell the grazing rights. (Question: 
how much would these rights sell for? Hint: think about rents.) The 
government could also impose a tax, t per cow, such that /(c*)/c* +1 = a. 


35 Information Technology 


35.1. They should be willing to pay up to $50, since this is the present value 
of the profit they can hope to get from that customer in the long run. 

35.2. Users would gravitate toward packages with the most users, since that 
would make it more convenient for them to exchange files and information 
about how to use the program. 

35.3. In this case the profit maximization conditions are identical. If two 
people share a video, the producer would just double the price and make 
exactly the same profits. 


36 Public Goods 

36.1. We want the sum of the marginal rates of substitution to equal the 
marginal cost of providing the public good. The sum of the MRSs is 20 
(= 10 x 2), and the marginal cost is 2x. Thus we have the equation 2x = 20, 
which implies that x = 10. So the Pareto efficient number of streetlights is 
10 . 


37 Asymmetric Information 


37.1. Since only the low-quality cars get exchanged in equilibrium and there 
is a surplus of $200 per transaction, the total surplus created is 50 x 200 = 
$ 10 , 000 . 

37.2. If the cars were assigned randomly, the average surplus per trans¬ 
action would be the average willingness to pay, $1800, minus the average 
willingness to sell, $1500. This gives an average surplus of $300 per trans¬ 
action and there are 100 transactions, so we get a total surplus of $30,000, 
which is much better than the market solution. 
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37.3. We know from the text that the optimal incentive plan takes the 
form s(x) = wx + K. The wage w must equal the marginal product of 
the worker, which in this case is 1. The constant K is chosen so that the 
worker’s utility at the optimal choice is u = 0. The optimal choice of x 
occurs where price, 1, equals marginal cost, x, so x* = 1. At this point the 
worker gets a utility of x* + K — c(x*) = 1 + K ~ 1/2 = 1/2 + K. Since 
the worker’s utility must equal 0, it follows that K = —1/2. 

37.4. We saw in the last answer that the profits at the optimal level of 
production are 1/2. Since u = 0, the worker would be willing to pay 1/2 
to rent the technology. 

37.5. If the worker is to achieve a utility level of 1, the firm would have to 
give the worker a lump-sum payment of 1/2. 
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